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Abstract. In this paper, we present a systematic way of deriving (1) languages of (gen- 
eralised) regular expressions, and (2) sound and complete axiomatizations thereof, for a 
wide variety of systems. This generalizes both the results of Kleene (on regular languages 
and deterministic finite automata) and Milner (on regular behaviours and finite labelled 
transition systems), and includes many other systems such as Mealy and Moore machines. 



1. Introduction 

In a previous paper [9], we presented a language to describe the behaviour of Mealy 
machines and a sound and complete axiomatization thereof. The defined language and 
axiomatization can be seen as the analogue of classical regular expressions [21] and Kleene 
algebra [22], for deterministic finite automata (DFA), or the process algebra and axiomati- 
zation for labelled transition systems (LTS) [28]. 

We now extend the previous approach and devise a framework wherein languages and 
axiomatizations can be uniformly derived for a large class of systems, including DFA, LTS 
and Mealy machines, which we will model as coalgebras. 

Coalgebras provide a general framework for the study of dynamical systems such as 
DFA, Mealy machines and LTS. For a functor Q : Set — )• Set, a ^-coalgebra or ^-system 
is a pair {S,g), consisting of a set S of states and a function g: S ^ GiS) defining the 
"transitions" of the states. We call the functor Q the type of the system. For instance, 
DFA can be modelled as coalgebras of the functor Q{S) = 2 x S , Mealy machines are 
obtained by taking G{S) = [B x S) and image-finite LTS are coalgebras for the functor 
G{S) = (TZiS))^, where IZ is finite powerset. 
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Under mild conditions, functors G have a final coalgebra (unique up to isomorphism) into 
which every ^-coalgebra can be mapped via a unique so-called Q-homomorphism. The final 
coalgebra can be viewed as the universe of all possible Q -behaviours: the unique homomor- 
phism into the final coalgebra maps every state of a coalgebra to a canonical representative 
of its behaviour. This provides a general notion of behavioural equivalence: two states 
are equivalent if and only if they are mapped to the same element of the final coalgebra. 
Instantiating the notion of final coalgebra for the aforementioned examples, the result is 
as expected: for DFA the final coalgebra is the set 2 of all languages over A] for Mealy 
machines it is the set of causal functions /: A^ — )• B'^; and for LTS it is the set of finitely 
branching trees with arcs labelled by a S A modulo bisimilarity. The notion of equivalence 
also specializes to the familiar notions: for DFA, two states are equivalent when they ac- 
cept the same language; for Mealy machines, if they realize (or compute) the same causal 
function; and for LTS if they are bisimilar. 

It is the main aim of this paper to show how the type of a system, given by the functor 
Q, is not only enough to determine a notion of behaviour and behavioural equivalence, 
but also allows for a uniform derivation of both a set of expressions describing behaviour 
and a corresponding axiomatization. The theory of universal coalgebra [31] provides a 
standard equivalence and a universal domain of behaviours, uniquely based on the functor 
Q. The main contributions of this paper are (1) the definition of a set of expressions Expg 
describing ^-behaviours, (2) the proof of the correspondence between behaviours described 
by Expg and locally finite ^-coalgebras (this is the analogue of Kleene's theorem), and (3) 
a corresponding sound and complete axiomatization, with respect to bisimulation, of Expg 
(this is the analogue of Kleene algebra). All these results are solely based on the type of 
the system, given by the functor Q. 

In a nutshell, we combine the work of Kleene with coalgebra, considering the class 
of non-deterministic functors. Hence, the title of the paper: non-deterministic Kleene 
coalgebras. 

Organization of the paper. In Section [2] we introduce the class of non-deterministic 
functors and coalgebras. In Section [3] we associate with each non-deterministic functor Q 
a generalized language Expg of regular expressions and we present an analogue of Kleene's 
theorem, which makes precise the connection between Expg and ^-coalgebras. A sound 
and complete axiomatization of Expg is presented in Section [H Section [5] contains two more 
examples of application of the framework and Section[6]shows a language and axiomatization 
for the class of polynomial and finitary coalgebras. Section [7] presents concluding remarks, 
directions for future work and discusses related work. This paper is an extended version 
of im ilOj: it includes all the proofs, more examples and explanations, new material about 
polynomial and finitary functors and an extended discussion section. 

2. Preliminaries 

We give the basic definitions on non-deterministic functors and coalgebras and introduce 
the notion of bisimulation. 

First we fix notation on sets and operations on them. Let Set be the category of sets 
and functions. Sets are denoted by capital letters X, y, . . . and functions by lower case 
f,g,.... We write for the empty set and the collection of all finite subsets of a set X is 
defined as Vj{X) = {Y C X \ Y finite}. The collection of functions from a set X to a set 
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Y is denoted by Y . We write idx for the identity function on set X. Given functions 
/ : X — )• y and g : Y ^ Z we write their composition as g o f. The product of two 

sets X, Y is written as X x Y, with projection functions X < X x Y > Y .The set 

1 is a singleton set typicaUy written as 1 = {*} and it can be regarded as the empty 
product. We define X ^ Y as the set X tt) Y tt) {_L,T}, where tt) is the disjoint union of 

sets, with injections X > X ^Y < Y ■ Note that the set X '^Y is different from 

the classical coproduct of X and Y (which we shall denote hy X + Y), because of the two 
extra elements _L and T. These extra elements will later be used to represent, respectively, 
underspecification and inconsistency in the specification of some systems. The intuition 
behind the need of these extra elements will become clear when we present our language 
of expressions and concrete examples, in Section 13.3. H of systems whose type involves ^. 
Note that X^X^2x X^X + X. 

For each of the operations defined above on sets, there are analogous ones on functions. 
Let /: X— T-y, /i: X— )-y and /2 : Z ^ W. We define the following operations: 

fix f2: X X Z ^Y xW fi<^ f2: X^Z ^Y^W 

{hxh){{x,z)) = {h{x)j2{z)) (f,^f2){c) = C, CG{±,T} 

{fi^f2){'^i{x)) = >^i{Mx)), ie{i,2} 

f^{9) = fog nf){S) = {f{x)\xGS} 

Note that here we are using the same symbols that we defined above for the operations on 
sets. It will always be clear from the context which operation is being used. 

In our definition of non-deterministic functors we will use constant sets equipped with an 
information order. In particular, we will use join-semilattices. A (bounded) join-semilattice 
is a set B equipped with a binary operation Vb and a constant _Lb S B, such that Vb is 
commutative, associative and idempotent. The element _Lb is neutral with respect to Vb. 
As usual, Vb gives rise to a partial ordering <b on the elements of B: 

6i <B ^2 ^ ^1 Vb ^2 = h 

Every set S can be mapped into a join-semilattice by taking B to be the set of all finite 
subsets of S with union as join. 

Non-deterministic functors. Non-deterministic functors are functors Q: Set — )■ Set, 
built inductively from the identity and constants, using x, ^, (— ) and IZ- 

Definition 2.1. The class NDF of non- deterministic functors on Set is inductively defined 
by putting: 

NDF Bg:: = \d\B\g^g\gxg\g^\T^g 

where B is a finite (non-empty) join-semilattice and ^4 is a finite set. Jft 

Since we only consider finite exponents A = {ai, . . . , a„}, the functor (— ) is not really 
needed, since it is subsumed by a product with n components. However, to simplify the 
presentation, we decided to include it. 
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Next, we show the exphcit definition of the functors above on a set X and on a morphism 
f:X^Y (note that g{f): g{X) -^ g{Y)). 

\A{x)=x B(x) = B {Qi^Q2){x) = gi{x)^g2{x) 

ld(/) = / B(/) = zdB {Gl^Q2){f) = Gl{f)^G2U) 

{gA){x) = q{x)^ {UQ){x)=u{g{x)) {Qi>iQ2){x) = gi{x)xg2{x) 
{Q^){f) = Gif)^ {V.gm = VUGif)) [Gi X G2){f) = Giif) x G2{f) 

Typical examples of non-deterministic functors include M = (Bx Id), T> = 2x Id , 
Q = (1 ^ Id)"^ and J\f = 2 x (^Id)^, where 2 = {0, 1} is a two-element join semilattice 
with as bottom element (1 V = 1) and 1 = {*} is a one element join-semilattice. These 
functors represent, respectively, the type of Mealy, deterministic, partial deterministic and 
non-deterministic automata. In this paper, we will use the last three as running examples. 
In [9], we have studied in detail regular expressions for Mealy automata. Similarly to 
what happened there, we impose a join-semilattice structure on the constant functor. The 
product, exponentiation and powerset functors preserve the join-semilattice structure and 
thus do not need to be changed. This is not the case for the classical coproduct and thus 
we use ^ instead, which also guarantees that the join semilattice structure is preserved. 

Next, we give the definition of the ingredient relation, which relates a non-deterministic 
functor G with its ingredients, i.e. the functors used in its inductive construction. We shall 
use this relation later for typing our expressions. 

Definition 2.2. Let < C NDF x NDF be the least reflexive and transitive relation on non- 
deterministic functors such that 



Gi<GixG2, G2<Giy<G2, Gi<iGi^G2, G2<iGi^G2, G-^G"^, G <\1ZG 



* 



Here and throughout this document we use J- <\G ds, a, shorthand for {J-,G) G <l. If 
J- < G, then T is said to be an ingredient of G. For example, 2, Id, Id and V itself are all 
the ingredients of the deterministic automata functor D = 2 x Id . 

Non-deterministic coalgebras. A non-deterministic coalgebra is a pair {S, f : 5 — )• G{S)), 
where S* is a set of states and ^ is a non-deterministic functor. The functor G, together with 
the function /, determines the transition structure (or dynamics) of the ^-coalgebra |31j . 
Mealy, deterministic, partial deterministic and non-deterministic automata are, respectively, 
coalgebras for the functors A^ = (B x Id)^, V = 2x Id^, Q = (1^ Id)^ and AA = 2 x (:Sld)^. 
A G -honioniorphism from a ^-coalgebra {S,f) to a ^-coalgebra {T,g) is a function 
h: S ^- T preserving the transition structure, i.e. such that g oh = G{h) o /. 

Definition 2.3. A ^-coalgebra (17, w) is said to be final if for any ^-coalgebra (5, /) there 
exists a unique ^-homomorphism beh^: S" — t- il. 4k 

For every non-deterministic functor G there exists a final ^-coalgebra (ilg,a;g) [31j. For 
instance, as we already mentioned in the introduction, the final coalgebra for the functor T) 
is the set of languages 2 over A, together with a transition function d : 2 — t- 2 x (2 ) 
defined as d{(p) = {(f){e) , \aXw .(j){aw)) . Here e denotes the empty sequence and aw denotes 
the word resulting from prefixing w with the letter a. The notion of finality will play a key 
role later in providing a semantics to expressions. 

Given a ^/-coalgebra (S*, /) and a subset F of 5 with inclusion map i: V ^ S wq say 
that y is a subcoalgebra of S if there exists g: V ^ GiV) such that z is a homomorphism. 
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Given s G S, (s) = (T, t), denotes the smallest subcoalgebra generated by s, with T given 
by 

T = \\{V I y is a subcoalgebra of S and s G V} 

If the functor J^ preserves arbitrary intersections, then the subcoalgebra (s) exists. This 
will be the case for every functor considered in this paper. Moreover, all the functors we 
will consider preserve monos and thus the transition structure t is unique 0T, Proposition 
6.1]. 

We will write Coalg{Q) for the category of ^-coalgebras together with coalgebra homo- 
morphisms. We also write Coalgi^YiQ) fo^ the category of ^-coalgebras that are locally finite. 
Objects are ^-coalgebras {S, /) such that for each state s £ S the generated subcoalgebra 
(s) is finite. Maps are the usual homomorphisms of coalgebras. 

Let {S,f) and {T,g) be two ^-coalgebras. We call a relation R C S x T a bisimula- 
tion Hg iff 

_ {s,t)€R^{f{s),g{t))Gg{R) 

where g{R) is defined as g{R) = {{Q{tti){x),Q{tt2){x)) \ x G Q{R)}. We write s ~g t 
whenever there exists a bisiniulation relation containing (s, t) and we call ~g the bisimilarity 
relation. We shall drop the subscript G whenever the functor G is clear from the context. 
For all non-deterministic ^-coalgebras {S, f) and (T, g) and s £ S,t £ T, it holds that 
s ~ t <^=^ beh5(s) = behr(t) (the left to right implication always holds, whereas the 
right to left implication only holds for certain classes of functors, which include the ones we 
consider in this paper [31]|35])- 



3. A LANGUAGE OF EXPRESSIONS FOR NON-DETERMINISTIC COALGEBRAS 

In this section, we generalize the classical notion of regular expressions to non-deter- 
ministic coalgebras. We start by introducing an untyped language of expressions and then 
we single out the well-typed ones via an appropriate typing system, thereby associating 
expressions to non-deterministic functors. 

Definition 3.1 (Expressions). Let ^ be a finite set, B a finite join-semilattice and X a set 
of fixed point variables. The set Exp of all expressions is given by the following grammar, 
where a G A, b £ B and x £ X: 

e ::= 0|x|e©e| //x.7 | b \ /(e) | r(e) \l[e\\ r[e\ \ a(e) | {e} 

where 7 is a guarded expression given by: 

7 :: = I 7 © 7 I ^x.7 I b I l{e) \ r{e) \ l[e] \ r[e] \ a{e) \ {e} 

The only difference between the BNF of 7 and e is the occurrence of x. J(t 

In the expression jUX.7, /i is a binder for all the free occurrences of x in 7. Variables 
that are not bound are free. A closed expression is an expression without free occurrences 
of fixed point variables x. We denote the set of closed expressions by Exp'^. 

Intuitively, expressions denote elements of the final coalgebra. The expressions 0, ei 
£2 and fix. £ will play a similar role to, respectively, the empty language, the union of 
languages and the Kleene star in classical regular expressions for deterministic automata. 
The expressions /(e) and r(e) refer to the left and right hand-side of products. Similarly, l[e] 
and r[e] refer to the left and right hand-side of sums. The expressions a(e) and {e} denote 
function application and a singleton set, respectively. We shall soon illustrate, by means 
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of examples, the role of these expressions. Here, it is already visible that our approach 
(to define a language) for the powerset functor differs from classical modal logic where D 
and are used. This is a choice, justified by the fact that our goal is to have a "process 
algebra" like language instead of a modal logic one. It also explains why we only consider 
finite powerset: every finite set can be written as the finite union of its singletons. 

Our language does not have any operator denoting intersection or complement (it only 
includes the sum operator ©). This is a natural restriction, very much in the spirit of 
Kleene's regular expressions for deterministic finite automata. We will prove that this 
simple language is expressive enough to denote exactly all locally finite coalgebras. 

Next, we present a typing assignment system for associating expressions to non-de- 
terministic functors. This will allow us to associate with each functor Q the expressions 
e S Exp'^ that are valid specifications of ^-coalgebras. The typing proceeds following the 
structure of the expressions and the ingredients of the functors. 

Definition 3.2 (Type system). We define a typing relation hC Exp x NDF x NDF that will 
associate an expression e with two non-deterministic functors J-" and G, which are related by 
the ingredient relation (J^ is an ingredient of G). We shall write h e: J- <\G for (e, J^, G) G l~. 
The rules that define h are the following: 





h0: 


F<\g 


Kei 


■.F<g 


h £2 : -^ < ^ 




^ ei®e2- T <Q 




he: 


:Fi<\g 



h6: 


B<a 


he 


■■g<^g 


he: 


\d<^g 


he: 


T^^g 



hx: 


■■g<g 


he 


■.T<g 


h{e}: 


iz:F<\g 


he: 


j'i<g 



he: 


■■g<ig 


h fix.l 


z:g<\g 


he: 


T<g 


ha(e) 


■.T^<\g 


he: 


3^2 <\g 



hl{e):TixT2<ig h r(e) : J-i X J-2 < a h l[e]: Ti ^ T2 <ig h r[e] : J^i ^ 7^2 < 5 

Intuitively, h e: J- <\ G (for a closed expression e) means that e denotes an element 
of J-{0,g), where fig is the final coalgebra of G- As expected, there is a rule for each 
expression construct. The extra rule involving Id <l ^ reflects the isomorphism between 
the final coalgebra 0,g and G{^g) (Lambek's lemma, cf. [31j)- Only fixed points at the 
outermost level of the functor are allowed. This does not mean however that we disallow 
nested fixed points. For instance, /ire. a{x ® ^y. a{y)) would be a well-typed expression for 
the functor D of deterministic automata, as it will become clear below, when we will present 
more examples of well-typed and non-well-typed expressions. The presented type system 
is decidable (expressions are of finite length and the system is inductive on the structure 
of e G Exp). Note that the rules above are meant to be read as an inductive definition 
rather than as an algorithm. In an eventual implementation, extra care is needed in the 
case ^ = Id, to avoid looping in the rule for Id < ^. 

We can formally define the set of ^-expressions: (closed and guarded) well-typed ex- 
pressions associated with a non-deterministic functor G- 

Definition 3.3 (^-expressions). Let ^ be a non-deterministic functor and T an ingredient 
of G- We define Expjr^g by: 

Expjr^g = {e e Exp'' I h e: T <iG}. 

We define the set Expg of well-typed G-expressions by Expg^g. Jjt 
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Let us instantiate the definition of ^-expressions to the functors of deterministic au- 
tomata P = 2 X Id^. 

Example 3.4 (Deterministic expressions). Let ^ be a finite set of input actions and let X 
be a set of (recursion or) fixed point variables. The set Exp25 of deterministic expressions is 
given by the set of closed and guarded (each variable occurs in the scope of a(— )) expressions 
generated by the following BNF grammar. For a G A and x ^ X: 



Exp25 9 e 

£2 



e © e I ^x.e \ x \ l{ei) \ r{e2) 

I 1 I £1 © £1 

a{e) I £2 © £2 



Examples of well-typed expressions for the functor P = 2 x Id (with 2 = {0, 1} a 
two-element join-semilattice with as bottom element; recall that the ingredients of D are 
2, Id^ and T> itself) include r{a{9)), /(I) ©r(a(?(0))) and fix.r{a{x)) ®1{1). The expressions 
/[I], /(!)©! and fj,x.l are examples of non well-typed expressions for V, because the functor 
D does not involve ^, the subexpressions in the sum have different type, and recursion is 
not at the outermost level (1 has type 2 <\T>), respectively. 

It is easy to see that the closed (and guarded) expressions generated by the grammar 
presented above are exactly the elements of Exp^. The most interesting case to check is 
the expression r{a{e)). Note that a(e) has type Id < 2? as long as e has type Id <l V. And 
the crucial remark here is that, by definition of h, Exp^^g C Expg. Therefore, e has type 
Id oP if it is of type T><iT>, or more precisely, if e G Exp-p, which explains why the grammar 
above is correct. 

At this point, we should remark that the syntax of our expressions differs from the 
classical regular expressions in the use of fj, and action prefixing a(e) instead of star and 
full concatenation. We shall prove later that these two syntactically different formalisms 
are equally expressive (Theorems 13.121 and I3.14p . but, to increase the intuition behind our 
expressions, let us present the syntactic translation from classical regular expressions to 
Exp-p (this translation is inspired by [28j) and back. 

Definition 3.5. The set of regular expressions is given by the following syntax 

RE B r:: = \ 1 \ a \ r + r \ r ■ r \ r* 

where a £ A and • denotes sequential composition. We define the following translations 
between regular expressions and deterministic expressions: 



(-)t: RE^ Expi, 


{-)t:Expj,^RE 


(0)t =0 


(0)t = 


(1)^ = m 


(/(0))t = (l{0))t = (r(0))t = 


(a)t =r{a{l{l))) 


(^(1»* =1 


(ri+r2)t =(ri)t©(r2)t 


{l{ei(Be2))^ ={l{e,))t + {l{e,))t 


(ri-r2)t = (ri)t[(r2)t//(l)] 


(r(a(e)))t = a ■ {e)i 


(r*)t =fix.{r)^x/l{l)](Bl{l) 


(r(ei©£2))* =(r(ei))^ + (r(62))^ 




(£i©e2)* ={ei)^ + {e2)^ 




(fix.e)^ = sol(eqs(//x.e)) 



The function eqs translates fix.e into a system of equations in the following way. Let 
/Ltxi.ffi, . . . ,nxn-£n be all the fixed point subexpressions oi fix.s, with xi = x and Si = e. We 
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define n equations Xi = (ej)^, wliere Si is obtained from ej by replacing each subexpression 
fiXi.Ei by Xi, for alH = 1, . . . n. The solution of the system, sol(eqs(^x.e)), is then computed 
in the usual way (the solution of an equation of shape x = rx + 1 is r*t). 

In [32], regular expressions were given a coalgebraic structure, using Brzozowski deriva- 
tives [13]. Later in this paper, we will provide a coalgebra structure to Exp-p, after which 
the soundness of the above translations can be stated and proved: r '--' r^ and e ~ e^, where 
~ will coincide with language equivalence. Jft 

Thus, the regular expression aa* is translated to r{a{iix.r{a{x)) ® ^(1))), whereas the 
expression fj.x.r{a{r{a{x)))) © /(I) is transformed into (aa)*. 

We present next the syntax for the expressions in Expg and in Exp^ (recall that Q = 
(l^ld)^ andAA = 2x (^gld)^). 

Example 3.6 (Partial expressions). Let A be a finite set of input actions and X be a set 
of (recursion or) fixed point variables. The set Expg of partial expressions is given by the 
set of closed and guarded expressions generated by the following BNF grammar. For a £ A 
and X G X: 

Expg 3 e :: = 0|e©e| fj,x.e \ x \ a{ei) 

£i :: = I ei©ei I l[£2] \ r[e] 

£2 "= I £2 © £2 I * 

Intuitively, the expressions a(/[*]) and a(r[e]) specify, respectively, a state which has no 
defined transition for input a and a state with an outgoing transition to another one specified 
by e. ^ 

Example 3.7 (Non-deterministic expressions). Let ^ be a finite set of input actions and 
X be a set of (recursion or) fixed point variables. The set Expj^ of non- deterministic 
expressions is given by the set of closed and guarded expressions generated by the following 
BNF grammar. For a £ A and x £ X: 

Exp_^ Be :: = I X I r{e2) \ l{ei) | e © e | fj,x.e 

£1 :: = I ei©ei I 1 I 

: I £2 © £2 I a{e') 

: I £' © £' I {£} 

Intuitively, the expression r{a{{ei} © {£2})) specifies a state which has two outgoing tran- 
sitions labelled with the input letter a, one to a state specified by £1 and another to a state 
specified by £2. <|k 

We have defined a language of expressions which gives us an algebraic description of 
systems. We should also remark at this point that in the examples we strictly follow the 
type system to derive the syntax of the expressions. However, it is obvious that many 
simplifications can be made in order to obtain a more polished language. In particular, 
after the axiomatization we will be able to decrease the number of levels in the above 
grammars, since will we have axioms of the shape a(£) © a{e') = a(e © e'). In Section [5l we 
will sketch two examples where we apply some simplification to the syntax. 

The goal is now to present a generalization of Kleene's theorem for non-deterministic 
coalgebras (Theorems 13.121 and 13.14] ). Recall that, for regular languages, the theorem states 
that a language is regular if and only if it is recognized by a finite automaton. In order to 
achieve our goal we will first show that the set Expg of ^/-expressions carries a ^-coalgebra 
structure. 



£2 
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3.1. Expressions are coalgebras. In this section, we show that the set of ^-expressions 
for a given non-deterministic functor Q has a coalgebraic structure 6g: Expg — )■ ^(Expg) . 
More precisely, we are going to define a function 

ST<g ■■ Expj-^jg; -^ -F(Expg;) 

for every ingredient J-" of ^, and then set 6g = 5g<ig. Our definition of the function 5j^<ig 
will make use of the following. 

Definition 3.8. For every Q £ NDF and for every J^ with J^ <\Q: 

(i) we define a constant Empty jr<ig G J^(Expg) by induction on the syntactic structure 
of J": 

Emptyid<g = Empty_^^^^^^g = ± 

EmptYB^g = -Lb Empty^A^g = Aa.Emptyj-^g. 

Empty^^x.F24g = (Empty ^^^g, Empty ^^^g) Empty^^^g = 

(ii) we define a function Plusj-^jg: J^(Expg) x J^(Expg) — ;■ J^(Expg) by induction on the 
syntactic structure of T: 

Plusid<]g(ei,e2) = ei © ^2 

PluSB<]g(6l,62) = ^lVBfo2 

PluS^jXj-2<£;((ei,e2),(e3,e4)) = (PluSjr^4g(ei,e3),PluSjr2<jg(e2,e4)) 

P'us^i^j-2<£;('«i(ei))'^i(<^2)) = Ki{P\usjr^<ig{ei,e2)), i e {1,2} 
P\uSjr^^jr^^g{Ki{ei),Kj{e2)) = T i,jG{l,2} and i j^ j 
PI^JS^i^J-2<e(^'T) = Plus^i^j-2<e(T,a:) = T 

PI^JS^i^J-2<6(^'^) = Plus^i^j-2<e(^'^) = ^ 

P\uSjrA^glf,g) = Xa. P\usjr^g{f{a),g{a)) 

PluS^jr<|g(si,S2) = Si U S2 

Intuitively, one can think of the constant Empty jr^jg and the function Plusj-^g as liftings of 
and © to the level of -F(Expg). J(> 

We need two more things to define 5jr^g. First, we define an order ^ on the types of 
expressions. For J^i, T2 and Q non-deterministic functors such that T\<\Q and T2 < Qi we 
define 

The order ^ is a partial order (structure inherited from <). Note also that (J^i <\ Q) = 
{^2 <^Q) -^ J'l = J'2- Second, we define a measure N{£) based on the maximum number of 
nested unguarded occurrences of ^u-expressions in e and unguarded occurrences of ©. We 
say that a subexpression iix.ei of e occurs unguarded if it is not in the scope of one of the 
operators /( — ), r( — ), /[— ], r[— ], a(— ) or {— }. 

Definition 3.9. For every guarded expression e, we define N{e) as follows: 

Ar(0) = N{h) = N{a{e)) = N{l{e)) = N{r{e)) = N{l[e]) = N{r[e]) = N{{e}) = 

N{ei © £2) = 1 + max{N{ei), N{e2)} 

N{fix.e) = 1 + N{e) 

The measure N induces a partial order on the set of expressions: ei ^ £2 "^ -^(^1) — -^(^2)) 
where < is just the ordinary inequality of natural numbers. 

Now we have all we need to define Sj^^g : Expjr<ig — )■ -F(Expg). 
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Definition 3.10. For every ingredient J- oi a non-deterministic functor Q and an expression 
e G Expjr^g, we define Sjr^g{e) as follows: 

(^^^G® = Emptyj-^g; 

ST<g{£l®!^2) = PluSjF<]g;((5jr<|g(ei),(5jr<£;(e2)) 
5g<ig{^X.e) = 5g^g{e[fix.e/x]) 

6\d<ig{e) = e for ^ / Id 

STixT2<gir{£)) = {Emptyjr^^g,5jr^4g{£)) 

5 ( (p)) = A ' / ^J^<gi^) ifa = a' 

■^ ^^ 'I Emptyjr^jg otherwise 

Here, e[fix.e/x] denotes syntactic substitution, replacing every free occurrence of x in e by 
fix.e. Jit 

In order to see that the definition of Sj^^jg is well- formed, we have to observe that Sj^^jg 
can be seen as a function having two arguments: the type J- < G and the expression e. 
Then, we use induction on the Cartesian product of types and expressions with orders :< 
and <^, respectively. More precisely, given two pairs {Ti <\Q,ei) and {7^2 O ^1^2) we have 
an order 

{J'i<G,ei)<{T2<iG,e2) ^ (i) (-Fi < a) ^ (-^2 < a) , . 

or (ii) (^1 < g) = (J-2 < g) and ei < £2 ^ ' 

Observe that in the definition above it is always true that {J^' <\ Q,e') < {J- <\g,£), for all 
occurrences of 6T'<ig{^') occurring in the right hand side of the equation defining 5jr^g{e). 
In all cases, but the ones that e is a fixed point or a sum expression, the inequality comes 
from point (i) above. For the case of the sum, note that {J^ <\g,ei) < {T <\ G , £1 ® £2) and 
{F<\Q,e2) < {F<G-,£i®£2) by point (ii), since iV(ei) <iV (£1062 ) and A^(e2) <^^(el©e2)■ 
Similarly, in the case of //x.e we have that A^(e) = iV(e[//x.e/x]), which can easily be proved 
by (standard) induction on the syntactic structure of e, since e is guarded (in x), and this 
guarantees that N{e[^x.e/x\) <N{^x.e). Hence, {Q <G-,s) < {Q <\G,fJ.x.e). Also note that 
clause 4 of the above definition overlaps with clauses 1 and 2 (by taking J^ = Id). However, 
they give the same result and thus the function 5j^<ig is well-defined. 

Definition 3.11. We define, for each non-deterministic functor G, a ^-coalgebra 

5g: Expg ^g(Expg) 
by putting 5g = (Jg^g. Jft 

The function 5g can be thought of as the generalization of the well-known notion of 
Brzozowski derivative [13] for regular expressions and, moreover, it provides an operational 
semantics for expressions, as we shall see in Section [3.21 

The observation that the set of expressions has a coalgebra structure will be crucial for 
the proof of the generalized Kleene theorem, as will be shown in the next two sections. 
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3.2. Expressions are expressive. Having a ^-coalgebra structure on Expg has two ad- 
vantages. First, it provides us, by finality, directly with a natural semantics because of the 
existence of a (unique) honiomorphism beh: Expg — )• Qg, that assigns to every expression 
e an element beh(e) of the final coalgebra Qg. 

The second advantage of the coalgebra structure on Expg is that it lets us use the 
notion of ^-bisimulation to relate ^-coalgebras {S,g) and expressions e £ Expg. If one can 
construct a bisimulation relation between an expression e and a state s of a given coalgebra, 
then the behaviour represented by e is equal to the behaviour of the state s. This is the 
analogue of computing the language L{r) represented by a given regular expression r and 
the language L{s) accepted by a state s of a finite state automaton and checking whether 
L{r) = Lis). 

The following theorem states that every state in a locally finite ^-coalgebra can be 
represented by an expression in our language. This generalizes half of Kleene's theorem for 
deterministic automata: if a language is accepted by a finite automaton then it is regular 
{i.e. it can be denoted by a regular expression). The generalization of the other half oi the 
theorem (if a language is regular then it is accepted by a finite automaton) will be presented 
in Section 13.31 It is worth to remark that in the usual definition of deterministic automaton 
the initial state of the automaton is included and, thus, in the original Kleene's theorem, 
it was enough to consider finite automata. In the coalgebraic approach, the initial state is 
not explicitly modelled and thus we need to consider locally-finite coalgebras: coalgebras 
where each state will generate a finite subcoalgebra. 

Theorem 3.12. Let Q be a non- deterministic functor and let {S,g) be a locally-finite Q- 
coalgebra. Then, for any s £ S, there exists an expression (( s )) E Expg such that s ~ {{s)). 

Proof. Let s £ S and let (s) = {si,...,s„} with si = s. We construct, for every state 
Si £ (s), an expression (( Si )) such that Si ~ (( Si )) . 

If ^ = Id, we set, for every i, (( Sj )) = 0. It is easy to see that {(si,0) | Si £ (s)} is a 
bisimulation and, thus, we have that s ~ {{s)). 

For Q 7^ Id, we proceed in the following way. Let, for every i, Ai = uxj.7 / -, where, for 

T <\Q and c £ T{s), the expression j^ £ Exp jr^g is defined by induction on the structure 
of J": 



ic 



eiTf} c^ 



cec 

otherwise 



Note that here the choice of /[0] ©r[0] to represent inconsistency is arbitrary but canonical, 
in the sense that any other expression involving sum of l[ei] and r[e2] will be bisimilar. 
Formally, the definition of 7 above is parametrized by a function from {si, . . . , Sn} to a fixed 
set of variables {xi, . . . ,x„}. It should also be noted that £i stands for ei®(e2©(£3©- • •)) 

(this is a choice, since later we will axiomatize to be commutative and associative). 

Let A^ = Ai, define A^+^ = A'y{A''^^^/xk+i} and then set (( s, )) = Af. Here, A{A'/x} 
denotes syntactic replacement (that is, substitution without renaming of bound variables 
in A which are also free variables in A'). The definition of (( Sj )) does not depend in the 
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chosen order of {si, . . . , Sn}: the expressions obtained are just different modulo renaming 
of variables. 

Observe that the term 

A^ = (mx,.7,%)){^?/xi} . . . {^r 7x4 

is a closed term because, for every j = 1, . . . , n, the term A-'-~ contains at most n — j free 
variables in the set {xj+i, . . . , Xn}- 

It remains to prove that Sj ~ {{si))- We show that R = {{si,{{si))) \ Sj G (s)} is a 
bisimulation. For that, we define, for J^ < ^ and c G J^{s), S,^ = lc{-^\/^i\ ■ ■ ■ {-^nT^^ / ^n} 
and the relation 

RT<g = {{c,8r<g{ic)) I c G -F(s)}. 
Then, we prove that ® -Rj-<ig = J^{R) and (2) (^(sj), (5g((( Sj )))) G i?g<g. 
(l) By induction on the structure of F. 

-F = ld| Note that Rm^g = {(sj,^!^) | si G (s)} which is equal to Id(-R) = R 



t|d<g 



provided that ^|, = (( si )). The latter is indeed the case: 

it = 7l1K/xi}...{^rV^n} (def. ei^) 

= x,K/xi}...{^rVx„} (def. 7i^) 

= A0{A?/xi}...{^rVx4 (def. 4-7 

= ((^0) (def. ((..))) 



F = B| Note that, for 6 G B, ^f = -if{A\/xi] . . . {A'^-^lxn} = b. Thus, we have 
that Rs^g = {{s^,^l) I s. G B(s)} = {(6, 6) | 6 G B} = B(i?). 



T = T\ X J'o 



(ii,t>), (ej)) G -Fi X F2{R)_ 
u,e) G J"i(i?) and {v,f) G J'2(i?) 



(def. -Fi X J2) 

u,e) G -Rj-i<g and {v,f) G -Rj:-2<iS (i^d. hyp.) 

u,e) = {c,5:Fi<g{ic^)) and (w,/) = (c', 5j-2<6;(Cj')) (def. i?j-,<]6;) 

u, v) = (c, c') and (e, /) = <5j-,xj-2<e(/(ef7 ® K4^')) (def. <5^^g) 

^,t;) = {c,c') and (e, /) = 5^,x.F2<e(<:;)-^') (^^f" ^^) 

{u,v),{e,f)) G RTixT2<g 

T = -Ff^ and .F = ^-Fi : similar to T\ x ^2- 



J- = J--I ^ JC-. 



We want to prove that {gisi),5g{{{si)))) G Rg<ig- For that, we must show that 
g{si) G ^(s) and 6g{{{si))) = (^glw^.-))- The former follows by definition of (s), 
whereas for the latter we observe that: 






<5e((Mx..7L,){A0/xi} . . . {Ar Vx„}) 



(def. of((s,)» 
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= 6g{^%^^{A'i/x,}...{A. 



S^i^ 



^/x,_i}{A^+i/x,+i} . . . {ArVa:„}[4"/^.]) (def. of 5g) 



i-2 



^M_i}{ArM}{A^+i/x.+i} . . . {Ar Vx„}) 



Here, note that [A"/xj] = {^"/xj}, because A^ has no free variables. The last 
two steps follow, respectively, because Xi is not free in j4* , i , . . . , A^~^ and: 



^i+l' • • • ' ^n 



{A 



i-l 



/x^]{A\^l/xi+l]...{Al Vx„} 



Equation ()3.2p uses the syntactic identity 

A{B{C/y]/x]{C/y] = A{B/x]{C/y], y not free in C 



(3.2) 

(3.3) 
D 



Let us illustrate the construction appearing in the proof of Theorem 13.121 by some 
examples. These examples will illustrate the similarity with the proof of Kleene's Theorem 
presented in most textbooks, where a regular expression denoting the language recognized 
by a state of a deterministic automaton is built using a system of equations. 

Consider the following deterministic automaton over A = {a, 6}, whose transition func- 
tion g is given by the following picture ((TsJ) represents that the state s is final): 



b a,b 

We define Ai = //xi.7^^^^ and A2 = /"a:2-7^s2) ^^^^^ 

7j(.,) = ^(0) e r(6(xi) e a(x2)) tJJ.^) = ^(1) © r{a{x2) © b{x2)) 

We have Aj = Ai{Al/x2} and Aj = A2{Ayxi}. Thus, ((52)) = ^2 and, since A^ = A2, 
{{ si )) is the expression 

fixi. l{0) e r{b{xi) e a(/iX2. /(I) © r(a(x2) © b{x2)))) 

By construction we have si ~ (( si )) and S2 ~ (( S2 )). 

For another example, take the following partial automaton, also over a two letter al- 
phabet A = {a, b}: 

b 

91 




Q2 



In the graphical representation of a partial automaton {S,p) we omit transitions for which 
p{s){a) = Ki(*). In this case, this happens in qi for the input letter b and in (72 for a. 
We will have the equations 

A^ = A°^=Al = nxi.b{l[*]) © a(r[x2]) 

A2 = A^ = Al = nx2.a{l[*]) © 6(r[x2]) 
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Thus: 

(( si ))=Al = fixi. b{l[*]) e a{r[fiX2. a{l[*]) b{r[x2])]) 

((sa)) = iiX2.a{^[*]) ® b{r[x2]) 

Again we have si ~ (( si )) and S2 ~ (( S2 ))• 

As a last example, let us consider the following non-deterministic automaton, over a 

one letter alphabet A = {a}: 




We start with the equations: 

Ai = /xxi./(0) er(a({xi}e{x2}e{2;3})) 
A2 = //X2./(0)er(a({x2}e{x3})) 
^3 = fiX3.l{l)er{a{{xi}®{xs})) 

Then we have the following iterations: 

Al=Ai 

A\ = Ai{Al/x2} = fixi.l{0) e r(a({xi} {^2} {X3})) 



Af = Ai{Al/x2}{Al/x3} = fixi.m © r(a({xi} {{A2{Al/x3})} {^1})) 



AI 



41 



A2{Ai/xi} = A2 
A2{Ai/xi} = A2 
A2{Ai/xi}{Al/xs} 



fiX2.l{0) 0r(a({x2}0{^i})) 



AsiAi/xi} = ^X3./(l) r{a{{Ai} {X3})) 
As{Ai/xi}{Al/x2} = ^X3i(l) r{ai{{Ai{Al/x2})} {X3})) 



42 
^3 



This yields the following expressions: 

((si))=/ixi./(0)®r(a({a:i}®{((s2))}®{((s3))})) 

((s2))=MX2.Z(0)®r(a({x2}©{((s3))})) 

(( S3 )) = iiX3.l{l) ® r(a({^cci J(0) ® r{a{{xi} ® {Aia;2./(0) ® r(a({a;2} ® {xs}))} ® {xs}))} ® {0:3})) 



3.3. Finite systems for expressions. Next, we prove the converse of Theorem 13.121 
that is, we show how to construct a finite ^-coalgebra {S, g) from an arbitrary expression 
e G Expg, such that there exists a state s G 5 with e ~g s. 

The immediate way of obtaining a coalgebra from an expression e G Expg is to com- 
pute the subcoalgebra (e), since we have provided the set Expg with a coalgebra structure 
5g: Expg — ;■ Q{Expg). However, the subcoalgebra generated by an expression e G Expg by 
repeatedly applying 5g is, in general, infinite. Take for instance the deterministic expression 
ei = jix. r{a{x fiy. r{a{y)))) (for simplicity, we consider A = {a} and below we will write, 
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in the second component of 5t>, an expression e instead of the function mapping a to e) and 
observe that: 

5v{ei) = {0,£i ® iJ.y.r{a{y))) 

5v{ei® iiy.r{a{y))) = {0,£i ® f^y.r{a{y)) ® ny.r{a{y))) 

Sv{£i © My- r{a{y)) © fiy. r{a{y))) = (0, si © fiy. r{a{y)) © /iy. r{a{y)) © fiy. r{a{y))) 



As one would expect, ah the new states are equivalent and will be identified by beh (the 
morphism into the final coalgebra). However, the function 6x> does not make any state 
identification and thus yields an infinite coalgebra. 

This phenomenon occurs also in classical regular expressions. It was shown in [13] 
that normalizing the expressions using the axioms for associativity, commutativity and 
idempotency was enough to guarantee finitenesqj. We will show in this section that this 
also holds in our setting. 

Consider the following axioms (only the first three are essential, but we include the 
fourth to obtain smaller coalgebras): 

(Associativity) ei © (^2 © £3) = (ei © £2) © £3 

(Commutativity) ei © £2 = £2 © £1 

(Idem,potency) e © e = e 

(Empty) © e = e 

We define the relation =acie'^ Expjr^^g x Exp^-^g, written infix, as the least equivalence 

relation containing the four identities above. The relation =acie gives rise to the (surjective) 

equivalence map [£]/ic'/B = W \ ^ =ACIE £'}• The following diagram shows the maps defined 

so far: 

,- 1-]acie ,- , 

Exp^^g > Exp_^^g/=^^,^ 

^(Expg) — — -^ J-(Expg/^^^,,^) 

>(|-Uc/b) 

In order to complete the diagram, we next prove that =acie is contained in the kernel of 

J^([-]acie) o St<S- 

This will guarantee the existence of a function 

Sjr^g: Expjr^g/=^^jj^ ^ J"(Expg/=^^^^) 

which, when F = Q, provides Expg/= with a coalgebraic structure 

5g: Expg/=^p,^ -^ g(Expg/=^p,^) 
(as before we write 5g for 5g<g) and which makes [— ]yicffi a homomorphism of coalgebras. 



Actually, to guarantee finiteness, similar to classical regular expressions, it is enough to eliminate double 
occurrences of expressions e at the outermost level of an expression •••©£©•••©£©••• (and to do this 
one needs the ACT axioms). Note that this is weaker than taking expressions modulo the ACT axioms: for 
instance, the expressions ei © £2 and £2 © £1, for £1 7^ £2, would not be identified in the process above. 

This is equivalent to prove that Expjr^g/=^c/£' together with [—\acie, is the coequalizer of the projection 
morphisms from ^acie to Expjr^g. 
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Lemma 3.13. Let Q and F he non- deterministic functors, with T <\Q. For all ei,e2 £ 

£1 =ACIE £2 => {F{[-]ACIE)){^T<g{ei)) = {F{[-]ACIE)){ST<g{£2)) 

Proof. In order to improve readability, in this proof we will use [— ] to denote [—]acie- 
It is enough to prove that for all xi, X2, X3 G -F(Expg): 

® .F([-])(PluSj-<3g;(PluSj-<]g;(xi,a;2),a;3)) = J='{[-]){P\uSjr^g{xi,P\uSjr<tg{x2,X3))) 

(2) ^(H)(Plus^<g(xi,X2)) = -F(H)(Plus^<e(a^2,xi)) 

(D ^(H)(Plus^<g(xi,xi)) =^(H)(^i) 

^(H)(Plus^<g(Empty^^g,xi)) = J-([-])(xi) 

By induction on the structure of J-". We illustrate a few cases, the omitted ones are proved 
in a similar way. 
-^ = Id I xi,a:2,X3 G Expg 



® [PluS|d<6;(PluS|d4g(xi,X2),X3)] 

= [(xi 0x2)0x3] 

= [xi e (x2 e X3)] 

= [PluS|d<e(xi,PluS|d<ig(x2,X3))] 

[PluS|d<g;(Emptyid^g;,xi)] 

= [0exi] 

= [^1] 



(def. Plus) 
(Associativity) 
(def. Plus) 

(def. Plus and Empty) 
(Empty) 



J" = -Fi X Jo 



xi = (ni,t;i),X2 = {u2,V2) G {Fi x J"2)(Expg) 

(-Fl X F2)i[-])iP\u5^ixT2<gi{ui,Vl), {U2,V2))) 

{Ti{[-]){P\usT,^g{ui,U2)),M[-W\^s^2<givi,V2))) (def. Plus) 
(•^i([-])(Plusjri<]g;(n2,ui)),J"2([-])(Plusjr2^g(t)2,t;i))) (ind. hyp.) 

{J='l X J='2){[-]){P\uSjr,y:jr.^^g{{u2,V2),{ui,Vi))) (def. Plus) 

(J"l X J='2){[-]){P\uSjr^y,jr^^g{{ui,Vl), {ui,Vl))) 

{Ti{[-]){P\us^,^g{m,ui)),T2{[-]){P\us^,<,g{vi,vi))) (def. Plus) 
(Ji([-])(^i),J-2([-])(^i)) (ind. hyp.) 

iTiXT2)i[-mUl,Vi)) 



T 



T^T\ | xi,X2,X3 G :S-Fi(Expg;) 

® :S-Fi([-])(PluS^j-,<3g;(xi,PluS^j-,<g(x2,X3))) 

= ^-Fi([-])(xi U (X2 U X3)) (def. Plus) 

= ^-Fi(H)((xiUX2)UX3) 

= :S-Fi([-])(Plus^j-,<]g(Plus^j-^<3g;(xi,X2),X3)) (def. Plus) 

In the last but one step, we use the fact that, for any set X, {J^{X\ U, 0) is a join-semilattice 
(hence, xi U (x2 U X3) = (xi U X2) U X3). Due to this fact, in the case T = T^J-\, in this 
particular proof, the induction hypothesis will not be used. □ 
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Thus, we have a well-defined function 

such that '6r^g{[e\AciE) = {^[-]AciE){5F<ig{£))- 

We are ready to state and prove the second half of Kleene's theorem. 

Theorem 3.14. Let Q he a non- deterministic functor. For every e G Expg, there exists 
Ag(e) = {S,g) such that S is finite and there exists s £ S with e ~ s. 

Proof. For every e £ Expg, we set Ag(e) = {[e]AciE) (recall that (s) denotes the smallest 
subcoalgebra generated by s). First note that, by Lemma I3.13| the map [—]acie is a 
homomorphism and thus e ~ [eJACiE- We prove, for every e G Expg, that the subcoalgebra 
(Macie) = (^T^g) has a finite state space V (here, 6g actually stands for the restriction of 
6g to V). Again, in order to improve readability, below we will use [— ] to denote [— ]^cffi- 
More precisely, we prove, for all e E Expjr^^g, the following inclusion 

V ^V = {[eie...eefc] I ei,...,efc G d{e) all distinct,ei, . . . ,6^ G Expg} (3.4) 

Here, if A; = we take the sum above to be and cl{e) denotes the smallest set containing 
all subformulas of e and the unfoldings of /x (sub)formulas, that is, the smallest subset 
satisfying: 

d(0) = {0} d{l[ei]) = {l[ei]} U cl{ei) 

d(eiee2) = {ei ® 62} U cl{ei) U cl{e2) c/(r[ei]) = {r[ei]} U cl{ei) 

cl{fix.£i) = {fix.Ei} U cl{ei[nx.ei/x]) c/(a(ei)) = {a{ei)} U cl{ei) 

diliei)) = {/(ei)}Ud(ei) d({ei}) = {{ei}} U d(ei) 

cl{r{ei)) = {r{ei)}Ucl{ei) 

Note that the set cl{e) is finite (the number of different unfoldings is finite) and has the 
property e G cl{e). 

We prove the inclusion in equation (13. 4p in the following way. First, we observe that 
[e] G V, because e £ cl{e). Then, we prove that {V,6g) (again, 6g actually stands for the 
restriction of dg to V) is a subcoalgebra of {Expg,5g). Thus, V '^ V, since V, the state 
space of ([e]) is equal to the intersection of all subcoalgebras of {Expg,Sg) containing [e]. 

To prove that {V, 6g) is a subcoalgebra we prove that, for ei, . . . , ejt G Expjr^g, 

ei,...,£k £ cl{e) all distinct ^ ^jr<]g;([ei . . . e^]) G J"(F) (3.5) 

The intended result then follows by taking T = Q. 

We first prove two auxiliary results, by induction on the structure of J-: 

® (J-H)(Empty^^g)GJ-(F) 

(D (J-[-])(Plus^^g(n,7;)) G T{V) ^ {F[-]){u) £ F{V) and {F[-]){v) £ HV) 

for u,v £ J^(Expg). 



J-=ld 



® (^H)(Empty^^g) = 1 G y 

(2) {J='[-]){P\usjr^g{u,v)) = [u®v]£V 4^[u]£V and[v] £V u,v £ Expg 

The right to left implication follows because, using the (Associativity), (Commutativity) 
and (Idempotency) axioms, we can rewrite u © f as ei ® . . . © e^, with all ei, . . . , e^ G c/(e) 
distinct. 
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T = B 



® (BH)(EmptyB^e) = ±BGB(F) 

(2) (B[-])(PlusB<3g(u,w)) = u\Jv(^ B(F) ^ud B(F) anduG B(F) u,w G B(Expg) = B 



T = T\ X To 



® (-Fix J-2H)(Empty^^x^^^c7) _ 

= ((J-iH)(Empty^,^e), (J-2H)(Empty^,^e)) G -Fi x J-2(F) 

(2) (-Fi X J'2[-])(Plusj-,xj-2<e;(('Ui,'U2),(i'i,w2») = _ 

((-FiH)(Plus^i<gK,^^i)),(-^2[-])(Plus^,<e(n2,t;2))) G -Fi x ^2(1^) 

<;=^ Mi,wi G F"i(y) and n2,t;2 G -^2(1/) 
<^ {u,v) G J"i X J'2(y), u = (^1,^2),^ = (^1,^2) G -Fi X J"2(Expg) 



J- = J-l ^ J-2 



and 



J^ = -Fi^ 



: similar to J-"i x J^2- 



^ = V^^i 



® (^^H)(Empty^^^g)=0G^^(y) 

© (^^[-])(PlusB^<,g(u,^)) = (m^[-])(n) U (^-FH)(«)) G ^-F(F) 
^ (S-F[-](n)) G ^-F(l^) and (S-F[-](t;)) G :S-F(y) 

Using (2), we can simplify our proof goal (equation (j3.5p ) as follows: 

'^T<ig{[ei e . . . e Ek]) G F(y) ^ {H-]){6^^g{ei)) G T{V), Si G d(e), i = 1, . . . , A; 

Next, using induction on the product of types of expressions and expressions (using the order 
in equation ()3.ip ). (T) and (2), we prove that (-F[— ])((5jr^g(ej)) G -F(F), for any £« G c/(e). 

iH-m^Togm) = (-F[-])(Empty^^g) G T(V) (by ®) 

(-F[-])(^^4g(ei e £2)) = {H-]){P\us^<,g{6^^g{ei),6T<g{e2)) G -F(F) (777 and (2) 

(a[-])(<5g^g(A.x.e)) = (a[-])(<5g<g(e[/xx.e/x])) G ^(F) 

(ld[-])(5id<g(e^)) = [e^] G ld(F) for ^ / Id 

(BH)'^B^g(&) = &GB(F) 

{J^i X J'2[-]){6T,xT,<g{l{e))) 
= {{J^,[-]){6^^^g{e)), (J-2H)(Empty^^^e)) ^ -Fi x T2iV) 

(-FixJ-2H)(57-,xj-,<g(r(e))) 

= ((-FiH)(Empty^^^g),(-F2H)('^^2<e(£))) G -^1 x -F2(F) 
(-Fi ^ J-2H)('5^,^j-,<e(ne])) = '^i((-^iH)(^7-i<g(e))) G -Fi ^ J-2(F) 
(-Fi ♦ J-2[-])(<5^^^^^^e(^N)) = ^2((-F2[-])(<5^,<g(e))) G -Fi ^ J-2(F) 

7A1 ^^fx . t^f^\\\ - I \.J ) (H-]){^T<gi.^)) ifa = a' 

Emptyjr^g otherwise 



i^^[-]){S^A^gia{e))) = iXa' 



(IH) 

{si G cl{e)) 
(B(F) = B) 

{IH and ®) 

{IH and ®) 

(ffl) 

(ffl) 

G T'^{V) {IH and ®) 



m^[-mS^T<g{{e})) = { {n-mST<g{e)) } G ^-F(F) 



(777) 



D 
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3.3.1. Examples, n this subsection we will illustrate the construction described in the proof 
of Theorem 13.141 given an expression e G Expg we construct a ^-coalgebra (5, g) such 
that there is s € S with s ~ e. For simplicity, we will consider deterministic and partial 
automata expressions over A = {a, b}. 

Let us start by showing the synthesised automata for the most simple deterministic 
expressions - 0, l{0) and 1{1). 



a,b 





The first two automata recognize the empty language and the last the language {e} 
containing only the empty word. 

We note that the generated automata are not minimal (for instance, the automata 
for l{0) and are bisimilar). Our goal has been to generate a finite automaton from 
an expression. From this the minimal automaton can always be obtained by identifying 
bisimilar states. 

The following automaton, generated from the expression r{a{l{l))), recognizes the lan- 
guage {a}, 

■ ' ' 5(1) 




For an example of an expression containing fixed points, consider e = ^x. r{a{l{0)(Bl{l)(Bx)). 
One can easily compute the synthesised automaton: 



fix. r{a{l{0) e /(I) e x)) l^^\l{0) (B l{l) (B e 




a,b 



and observe that it recognizes the language aa* . Here, the role of the join-semilattice 
structure is also visible: /(O) © /(I) © e specifies that this state is supposed to be non-final 
(/(O)) and final (/(I)). The conflict of these two specifications is solved, when they are 
combined with ©, using the semilattice: because 1 V = 1 the state is set to be final. 

As a last example of deterministic expressions consider ei = fj,x.r{a{x (B fj-y.r{a{y)))). 
Applying 6ti to ei one gets the following (partial) automaton: 



fix. r{a{x © /iy. r{a{y)))) V-^ ei © f^y. r{a{y)) 



Calculating 5t){si © Aiy. r{a{y))) yields 
dv{ei®fiy.r{a{y))) = 



(0,t) 



where t{a) = ei © fiy. r{a{y)) © /xy. r{a{y))) 
t{b) = 
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Note that the expression ei © ^y.r{a{y)) © ^y.r{a{y)) is in the same equivalence class 
as El © fiy . r {a{y)) , which is a state that already exists. As we saw in the beginning of 
Section [STTl by only applying 6x>, without ACI, one would always generate syntactically 
different states which instead of the automaton computed now: 



^ n r 



fix. r{a{x © fiy. r{a{y)))) > ei © fiy. r{a{y)) 4 




a,b 



would yield the following infinite automaton (with 82 = fiy.r{a{y))): 



fix. r{a{x © fiy. r{a{y)))) ] ° > [£i©£2 ) ° > [ £i©£2©g2] " > 




Let us next see a few examples of synthesis for partial automata expressions, where we 
will illustrate the role of _L and T. In the graphical representation of a partial automa- 
ton {S,p), we will omit transitions for inputs a with g{s){a) = Ki{*) and we will draw 

— -> g{s){a) whenever g{s){a) G {_L,T}. Note however that _L S and T 5 

and thus will have no defined transitions. As before, let us first present the corresponding 
automata for simple expressions - 0, a(/[*]), a(0) and «(/[*]) © &(^*])- 



a,b 



_L 



_L 




a{l[*]) ® b{l[*]) 



Note how _L is used to encode underspecification, working as a kind of deadlock state. In 
the first three expressions the behaviour for one or both of the inputs is missing, whereas 
in the last expression the specification is complete. 

The element T is used to deal with inconsistent specifications. For instance, consider 
the expression a(^[*]) © ?)(^[*]) © a(r[a(/[*]) © &(^*])]). All inputs are specified, but note 
that at the outermost level input a appears in two different sub-expressions - a(^*]) and 
a(r[a(/[*])©6(/ [*])]) - specifying at the same time that input a leads to successful termination 
and that it leads to a state where a(^[*]) © &(^[*]) holds, which is contradictory, giving rise 
to the following automaton. 



a(/M) ® K^M) ® a(r[a(/H) e b{l[*])]) 



-> T 



4. A SOUND AND COMPLETE AXIOMATIZATION 

In the previous section, we have shown how to derive from the type of a system, given 
by a functor G, a language Expg that allows for specification of ^-behaviours. Analogously 
to Kleene's theorem, we have proved the correspondence between the behaviours denoted 
by Expg and locally finite ^-coalgebras. In this section, we will show how to provide Expg 
with a sound and complete axiomatization. Again, the functor G will serve as a main guide 
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for the definition. The defined axioniatization is closely related to Kleene algebra (the set of 
expressions has a join semilattice structure) and to the axioniatization provided by Milner 
for CCS (uniqueness of fixed points will be required). When instantiating the definition 
below to concrete functors one will recover known axiomatizations, such as the one for 
CCS mentioned above or the one for labelled transition systems (with explicit termination) 
presented in [1] . The latter will be discussed in detail in Section \5\ 

Next, we introduce an equational system for expressions of type T <]Q. We define the 
relation = C Expj-^g x Expjr^g, written infix, as the least equivalence relation containing 
the following identities: 



(1) (Expjr<]g,©,0) is a join-semilattice. 






I e = e 
B(e2 



£2^ 
5 63) 



•£i 



(ei © £2) © £3 



(2) fi is the unique fixed point. 

"ylfix.^/x] = fix.'y 
^[e/x] = e =^ fj.x.'y = e 

(3) The join-semilattice structure propagates through the expressions. 



(Idempotency) 
( Commutativity) 
(Associativity) 
(Empty) 

(FP) 
( Unique) 



= U 



(B 

fx 



0) 



r(0) 



0) 



L) 
R) 



hi®h2 

l{ei®e2) 

r{ei®e2) 

a(£i ©£2) 

/[£i©e2] 

r[ei©e2] 

l[ei]®r[e2] 



hi Vb 62 

/(ei)©/(e2) 
r(ei)©r(e2) 
a(ei) ©a(e2) 

/[ei]©/N 

r[ei]®r[e2] 
/l]©r[0] 



)) 



L) 
R) 

L) 
R) 

T) 



(4) = is a congruence. 

£1 = S2 =^ e[ei/x] = e[e2/x] for x free in e (Cong) 

(5) Q-equivalence 

fj.x.'j = fiy.'y[y/x] for y not free in 7 (a — equiv) 

It is important to remark that in the third group of rules there does not exist any rule 
applicable to expressions of type T^T. 

Example 4.1. Consider the non-deterministic automata over the alphabet A = {a}: 




si 



S2 



S3 



Applying (( — )) (as defined in the proof of Theorem I3.12p one can easily compute the 
expressions corresponding to si and S2- 

£1 = (( si )) = nxi.l{0) © r{a({xi})) 

£2 = (( S2 )) = m-^(O) © r{a({fiy2.m © r{a({fiyi.l{0) © r {a({y2}))}))})) 
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We prove that £2 = ei- In the following calculations let e = /ixi.r(a({xi})). 

£2 = ei 

^ r{ai{fiy2.r{ai{r{a{{y2}))}))})) =e ((B - 0), (x - - L), (FP) and (Empty)) 

<^ fiy2-r{a{{r{a{{y2}))})) = e {(PP) on e and (Cong) twice) 

<^ r{a{{r{a{{e}))})) = e (uniqueness of fixed points) 

<^ r{a{{e})) = e (fixed point axiom) 

<^ £ = e (fixed point axiom) 

Note that the (Cong) rule was used in almost every step. 

For another example, consider the non-deterministic automaton over the alphabet A = 
{a,b}: 

Using the definition of (( — )) one can compute the following expressions for si, S2, S3 and 

54: 

£i = (( si )> = ^xi./(0) e r{a{{e2}) 6({e2})) 

£2 = {{S2)) =^X2./(O)©0 

£3 = (( S3 )) = ^X3i(0) © r{a{{£2}) © b{{£2} © {£4})) 

£4 = ((S4» =AiX4i(O)©0 

For £2 we calculate: 

£2 = /(O)©0 {FP) 

= /(0) {Empty) and {B - ^) 

= (x-0-L) 

Similarly, one has that £4 = 0. Next, we prove ei = £3: 

£1 = £3 

^ m(Br{a{{£2})(Bb{{£2}))=l{0)(Br{a{{£2})(Bb{{£2}(B{£4})) (FP) 

4^ /(O)©r(a({0})©6({0})) = /(O)©r(a({0})©6({0}©{0})) (£2 = = 64) 

^ l{0) © r{a{{^}) © 6({0})> = ^(0) r{a{{^}) © 6({0})> (Idempotency) 

The relation = gives rise to the (surjective) equivalence map [— ] : Expjr^g — ;■ Expjr^g/= 
defined by [£] = {£' | £ = £'}. The following diagram summarizes the maps we have defined 
so far: 

ExPj-<e; > Expjr^g/= 

•^(Exp^<jg) ^^ T{Expg/^) 

In order to complete the diagram, we next prove that = is contained in the kernel of 
J'{[—]) o Sj^<ig. This will guarantee the existence of a well-defined function 

djr^g: Expjr^g/= -^ J"(Expg/=) 

which, when F = Q, provides Expg/= with a coalgebraic structure dg : Expg/= — )■ Q{Expg/=,) 
(as before, we write dg to abbreviate 5g<]g) and which makes [— ] a homomorphism of coal- 
gebras. 



NON-DETERMINISTIC KLEENE COALGEBRAS 



23 



Lemma 4.2. Let Q and J- he non- deterministic functors, with T <\Q. For all ei,e2 £ 

Expjr^g with £i =62, 

Proof. By induction on the length of derivations of =. 

First, let us consider derivations of length 1. We need to prove the result for all the 
axioms in items 1. and 3. plus the axioms FP and (a — equiv). 

For the axioms in 1. the result follows by Lemma 13.131 The axiom FP follows trivially 
because of the definition of 5g, since 5g{fj,x.^) = 5g{'y[fix.j/x]) and thus Q{[—])o6g{fix.^) = 

0{[-])°^gii[i^x.j/x]). 

For the axiom (a — equiv) we use the (Cong) rule, which is proved below: 



Qii- 
Gii- 



o6g{fiX.'j) 

o6g{j[fIX.^/x]) (def. of^g) 

°Sg{l[lJ'y-l[y/x]/x]) {by (Cong)) 

° ^g{l[y/x][fiy-7[y/x]/y]) {A[B[y/x]/x] = A[y/x][B[y/x]/y], y not free in 7) 

o 6g{ny.'y[y/x])) (def. oiG{[-])o6g) 



Let us next show the proof for some of the axioms in 3.. The omitted cases are similar. 
We show for each axiom ei = £2 that (5j-<g(ei) = (Jj"<ig(e2)- 



U 



61 © 62 = h Vb 62 



JB<g{-^B 



(^E 



±F 



5B<]glwj 



<5B<g(fei Vb 62) = h Vb &2 = ^B<g{bi © 62 



m 



STixT2<g{K^)) = (Empty ^^^g, Empty j-^^g) = ^j-^xj-a^g® 



K^i ©^2) = liei)elie2) 



'5j-ixj-2<ie(K£i©e2)) 

{STi<g{£:i © £2), Empty j-^^g)) 

(Plusj-,<]g(5j-,<g(ei), 5j-i<g(e2)), Plusj-2<]g(Emptyj-2^g, Empty ^^^e))) 

Plusj-ixj-2(('5j-i<]g(ei), Emptyj-^^g), ((5j-i<ig(e2), Empty j-^^g) 

s^,x^2<g{K^i) ® K£2))) 



/[eiffie2] = l[si]@l[e2] 



^J'l^^J^aoe 



(/[ei©e2]) 



KiiSj^,<ig{ei ©£2)) 

P' us j-i ^ j-2 (^1 (<Jj^i<]e (^1 ) ) ; /^i ((Jj^i <e (£2 ) ) 



^j-i<^j^2<ie 



G[ei]©/N) 



l[ei]er[e2] = lM®r\§ 



Plus 



T 



jr^<^jr^{Kl{5jr^^g{£l)),K2{5jr^^g{e2))) 



PluSj-i^J-2 (/^l(<^^l<e(0)), K2((5^2<e(i 



^Ti<^T2<g 



(/i]©r[0]) 
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Note that if we would have the axioms ^[0] = and r[0] = in the axiomatization 
presented above, this theorem would not hold. 

h,^r,<gm) = '^i(W) / ^ = h,^T,<Gm 

h.^r,<girm) = ^2{[1-]) / ^ = ST,^T,<gi^) 
Derivations with length A; > 1 can be obtained by two rules: (Unique) or (Cong). For the 
first (which uses the second), suppose that we have derived /ix.7 = £ and that we have 
already proved 7[e/x] = e. Then, we have: 

G{[-])o6g{fiX.j) = G{[-])o6g{-f[flX.j/x]) {del 6g) 

= QiH)oSgh[e/x]) (by (Cong)) 

= 0{[—])oSg{e) (induction hypothesis) 

For (Cong), suppose that we have derived e[ei/x] = £[£2/2^2] and that we have already 
derived ei = £2, which gives us, as induction hypothesis, the equality 

(-^[-])(<^^<g(^i)) = (-^[-])('^^^e(£2)) (4.1) 

This equation is precisely what we need to prove the case £ = x (and thus ei,e2:^ <1 G)- 
{0[-]){hix[ei/x]) = {G[-]){5g{£i)) 

= m-])isgi£2)) dUD 

= {0[-]){hix[e2/x])) 

For the cases £ ^ x, we prove that (^j-<g(e[ei/a;]) = 6j^<^g{£[£2/x]), by induction on the 
product of types of expressions and expressions (using the order defined in equation (|3.1|) ). 
We show a few cases, the omitted ones are similar. 

Sg<g{{fJ-y-e)[£i/x]) = 6g<tg{£[£i/x][fiy.£/y])) 

( IH) 

= 6g<tg{£[£2/x][fiy.£/y])) = Sg^g{{ny.£)[£2/x]) 
STixT2<g{l{£)[£i/x]) = (<5j-i<]g(e[ei/x]). Empty ^2^g;) 

{ Tlf] 

= {^Ti<g{£[£2/x]), Empty^^^g) = Sjr^^jr^^g{l{£)[£2/x]) 
'^J^1'^J-2<]S(^N[^1/^]) = Kl{6jr,^g{£[£i/x])) 

= ni{5T^<ig{e[£2/x\)) =<^j-,^^2<]e(^NN/2;]) 

D 

Thus, we have a well-defined function 9j-<]g : Expjr^g/= — )■ J^(Expg/=), which makes the 
diagram above commute, that is 5j-<g([e]) = (-^[— ])o5jr^g(e). This provides the set Expg/= 
with a coalgebraic structure dg: Expg/= — >• ^(Expg/=) which makes [— ] a homomorphism 
between the coalgebras (Expg,(5g) and (Expg/=,9g). 

4.1. Soundness and Completeness. Next we show that the axiomatization introduced 
in the previous section is sound and complete. 

Soundness is a direct consequence of the fact that the equivalence map [— ] is a coalgebra 
homomorphism. 

Theorem 4.3 (Soundness). Let Q he a non- deterministic functor. For all £±,£2 G Expg, 

ei = £2 =^ £1 ~ £2 
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Proof. Let t/ be a non-deterministic functor, let £1,62 G Expg and suppose that ei =62- 
Then, [ei] = [^2] and, thus 

where beh^ denotes, for any ^-coalgebra {S,g), the unique map into the final coalgebra. 
The uniqueness of the map into the final coalgebra and the fact that [— ] is a coalgebra 
homomorphism implies that behExpj,/= ° [—] = behExp^ which then yields 

behExpg(ei) = behExpg(e2) 

Since in the final coalgebra only the bisimilar elements are identified, ei ~ £2 follows. □ 

For completeness a bit more of work is required. Let us explain upfront the key steps 
of the proof. The goal is to prove that ei ~ £2 =^ ^i = £2- First, note that we have 

£1 ~ £2 <^ behExpg(£i) = behExpg(£2) ^ behExpg/^([£i]) = behExpg/^([£2]) (4.2) 

We then prove that behExp^ /= is injective, which is a sufficient condition to guarantee that 
£1 = £2 (since it implies, together with (14. 2p . that [£i] = [£2])- 

We proceed as follows. First, we factorize the map behExp^/^ into an epimorphism 
followed by a monomorphism [311 Theorem 7.1] as shown in the following diagram (where 
/ = behExpg/,(Expg/=)): 

Expg/= » /C ^^— t ng 



dg 



OJg 



UJQ 



g(Expg/,) — >g{i) — >g{ng) 

Then, we prove that (1) (Expg/=, dg) is a locally finite coalgebra (Lemma l4.4p and (2) both 
coalgebras (Expg/=,5g) and {I,LJg) are final in the category of locally finite ^/-coalgebras 
(Lemmas 14.71 and 14.81 respectively). Since final coalgebras are unique up to isomorphism, it 
follows that e: Expg/= — )• / is in fact an isomorphism and therefore behExp^/= is injective, 
which will give us completeness. 

In the case of the deterministic automata functor P = 2 x Id , the set / will be 
precisely the set of regular languages, the class of languages that can be denoted by regular 
expressions. This means that final locally finite coalgebras generalize regular languages (in 
the same way that final coalgebras generalize the set of all languages). 

We proceed with presenting and proving the extra lemmas needed in order to prove 
completeness. We start by showing that the coalgebra {Expg/=,dg) is locally finite (note 
that this implies that {I,iJg) is also locally finite) and that dg is an isomorphism. 

Lemma 4.4. The coalgebra {Expg/=,dg) is a locally finite coalgebra. Moreover, dg is an 
isomorphism. 

Proof. Local finiteness is a direct consequence of the generalized Kleene's theorem (Theo- 
rem [3Tl]). In the proof of Theorem 13.141 we showed that, given e G Expg, the subcoalgebra 
{W\acie) is finite. Thus, the subcoalgebra ([£]) is also finite (since Expg/= is a quotient of 

Expg/=^C/i;)- 
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To see that dg is an isomorphism, first define, for every J- <Q, 

d~j:\g{c) = m (4.3) 

where 7;?^ is defined, for J- ^ Id, as 7^ in the proof of Theorem 13.121 and for J^ = Id as 
7K = £. Then, we prove that d'^^g has indeed the properties (l) d^^^g °dT<g = id^^p /^ 
and (2) dj^^g o djr^g = «'^j-(Exp,r c/=)- Instantiating F = Q one derives that 5g is an 
isomorphism. It is enough to prove for (T) that 7^^ ,,v. = e and for (2) that dj^g(^-^\) = c. 
We illustrate a few cases. The omitted ones are similar. 

(T) By induction on the product of types of expressions and expressions (using the order 
defined in equation (j3.ip ). 

X- V X- T T (IH) 



7^ix-^2 =/(^i ^ffl^)er(7^2 , ,) = imer(e)=r(e) 

-g _ _g ({^) r , , 

^dgilfix.e]) - ^dg{[e[iJ.x.e/x]]) " e[^2;.e/XJ _ /iX.e 



Note that the cases e = and e = ei ® £2 require an extra proof (by induction on J^). More 
precisely, one needs to prove that 

It is an easy proof by induction. We illustrate here only the cases J-" = Id, J-" = B and 
F = FiX F2- 

@ 7|^=0 

7f±B] = ^B - 

''=^ im®rm = $ 
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7f,;,VBX2] = ^1 Vb X2 = XI ® X2 = 7f^,] ® 7f^2] 

^J^lXj^2 ^ 7^.^lX.^2 

'j"lXj"2[-](PluSjrj^xJ^2<5(^"l''"l)'^'"2,'U2>)) '(PluSjrj^(Mi,t;i),PluS;r2('"2,'y2)> 



~ 'wp|usjrj(ui,»;i)/ ® ^wp|us^2("2,t'2)' 

^ 7^.^lXJ'2 7r7-^lXjC-2 
' (u\,U2) '(til ,5^2) 
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By induction on the structure of T . 



d 



j"i<^j"2<e; 



([t 



■J-l^J-21 



d 

d 






m) = ^ 



AC2(C') 



(IH) 



di2T<gimn) 



(IH) 



\dT2T<gm,^c^c']) = {dT<g{[l^']) \ c G C} otherwise 
C 



D 



Next, we prove the analogue of the following useful and intuitive equality on regular 
expressions. Given a deterministic automaton (o, t) : 5 — t- 2 x S and a state s G 5", the 
associated regular expression r^ can be written as 



o('S) + ^a- r^s)ia) 



(4.4) 



aeA 



using the axioms of Kleene algebra |131 Theorem 4.4]. 

Lemma 4.5. Let {S,g) be a locally finite Q-coalgehra, with Q / Id, and let s G S, with 
(s) = {si, . . . , Sn} (where si = s). Then: 

^^^^)) = 7,%){((«i))M} ■■■{(( «n))/x„} (4.5) 

Proof. Let A^, where i and k range from 1 to n, be the terms defined as in the proof of 
Theorem Ela Recah that (( Si )) = ^f . We calculate: 

(( S^ )) 

= Af 

= (^^i-7,%))KM} • • • {^r V^n} 

= ^^xUl%^){AVx^} . . . {A^/x,.2}{Ai^i/x.+i} . . . {^rV^n}) 

= 7j(^^){A?/xi} . . . {4zl/xi.2}{Ai+Jx,+i} . . . {A^-^/xn}{A^/xi} (fixed point axiom^) 

= 7^%){^;/xi} . . . {A^-'/x^} (by[32D 

= j^^^^^^{A'i{Al/x2} . . . {^rV^n}/xi} . . . {^rV^n} (by[33D 

= 7f(^^){AVxi}{Ai/x2} . . . {^rV^n} (def. A-,) 



: (repeat last 2 steps ior A^, ■ ■ ■ , A^_i) 



(^n-1 — ^n) 



D 



Note that the fixed point axiom can be formulated using syntactic replacement rather than substitution 
j{nx.^/x} = iix.'y - since jj,x.^ is a closed term. 
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Instantiating (j4.5p for {o,t): S — )• 2 x S , one can easily spot the similarity with 
equation (|4.4p above: 

{{s)) ^l{o{s)) ®r(^a{{{t{s){a))))) 

Next, we prove that there exists a coalgebra homomorphism between any locally finite 
G- coalgebra {S,g) and {Expg/=,dg). 

Lemma 4.6. Let {S,g) be a locally finite Q-coalgehra. There exists a coalgebra homomor- 
phism [ — ] : S" — )• Expg/=. 

Proof. We define [ — ] = [—] o (( — )), where (( — )) is as in the proof of Theorem 13.121 
associating to a state s of a locally finite coalgebra an expression ((s)) with s ~ ((s)). To 
prove that [ — ] is a homomorphism we need to verify that (Q\ — A^) o g = dg o [ — ]. 

li Q = Id, then (^[ — ]) o g[si) = [0] = dg{\ Si]). For ^ / Id we calculate, using 
Lemma 14.51 

9g(r^.l) = 9g([7,%)[((^i))M]---[((^n))/xn]]) 

and we then prove the more general equality, for J- <\G and c G J~{s)'- 

dT<,g{bn{{si))/x^]...[{{s^))/xr.]])=T\-Uc) (4.6) 

The intended equality then follows by taking F = Q and c = g{si). Let us prove the 
equation (|4.6p by induction on F. 

C= Sj ^ {s) 

5id<g([7i1 K( si ))/xi] ...[(( .„ ))/xn]]) = [{{ s, ))] = \s,] 



F=\d 



F= B c = 6g B 

9B<g([7fK(si))/xi]... [((«„)) /x„]]) = [b] = B\-m 



F = Fi X F2 c = (ci, C2) G {Fi X F2){s) 



(IH) 



9^xx.F.<g([7gS^[(( si ))/xi] ...[(( .„ ))/x„]]) 

5-F,x^,<e([/(7c^^) e r(7,^^)[((si ))/xi] . . . [((s„))/x„]]) 

(5^.^g([7c^^ [(( si ))/x,] ...[{{ .„ ))/x„]]), a^,^g([7,^^ [(( .1 ))/xi] ...[(( s„ ))/x„] 

(-Fir-i(ci),j-2r-i(c2)) 
(-Fixj-2r-i)(c) 



J- = J-1 ^ J-2 



F = F^ 



and 



J^ = 'S-^i : similar to Fi x F2. 



D 



We can now prove that the coalgebras (Expg/=, 5g) and (/, ujg) are both final in the category 
of locally finite ^-coalgebras. 



Lemma 4.7. The coalgebra (I,u}g) is final in the category Coalg{Q)i,Y- 
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Proof. We want to show that for any locally finite ^-coalgebra {S,g), there exists a unique 
homomorphism {S,g) — )• {I,uJg). The existence is guaranteed by Lemma 14.61 where the 
homomorphism [ — ] : S — )• Expg/= is defined. Post-composing this homomorphism with 
e (defined above) we get a coalgebra homomorphism eo[ — ]:5— )•/. If there is another 
homomorphism /: S" — )• /, then by post-composition with the inclusion m: I ^^ Q we get 
two homomorphisms {mo f and moeo [ — ] ) into the final ^-coalgebra. Thus, / and eo [ — ] 
must be equal. D 

Lemma 4.8. The coalgebra (Expg/=,9g) is final in the category Coalg{Q)i,Y- 

Proof. We want to show that for any locally finite ^-coalgebra (S,g), there exists a unique 
homomorphism {S,g) — )• (Expg/=,9g). We only need to prove uniqueness, since the exis- 
tence is guaranteed by Lemma \^M where [ — ] : 5 — )• Expg/= is defined. 

Suppose we have another homomorphism /: 5" — )• Expg/=. Then, we shall prove that 
/ = [ — ]. Let, for any s £ S, fs denote any representative of f{s) (that is, f{s) = [fs])- 
First, observe that because / is a homomorphism the following holds for every s € S: 

f{s) = {dg' o g{f) o g){s) ^f,^ l%)[fsjxi] . . . [fsJXn] (4.7) 

where (s) = {si, . . . ,s„}, with si = s (recall that dg was defined in ()4.3p and note that 

Next, we prove that /s- = (( Sj )) (which is equivalent to f{si) = \ Si~\), for all i = 
1, . . . n. For simplicity we will here prove the case n = 3. The general case is identical but 
notationally heavier. First, we prove that /^^ = Ai[fs2/x2][fs3/x3]. 

fsi = I%,)[fsr/Xl][fsjx2][fsjxs] (by (gZl)) 

^ f-'^i = ^%i)if^2/x2][fs-jx3][fsjxi] (all /(si) are closed) 

=^ fs^ = IJ'Xi.-f^(^^_^-j[fs2/x2][fs-Jx3] (by uniqueness of fixed points) 

^ fsi=Ai[fsJx2][fsJx3] (def. ofAi) 

Now, using what we have computed for /^^ we prove that /sj = A^lfs-^/xs]. 

fs2 = I%,)[fsjxi][fsjx2][fsjx3] (by dlZD) 

^ fs2 = l^g(^s2)^Ai/xi][fs2/x2][fs3/x3\ (expressions for /^^ and dSD) 
^ fs2 = l%2)[Ai/xi][fsJx3][fs2/x2] (ah f{si) are closed) 
=^ fs2 = /ux2.7g(s2)[^i/a:i][/s3/2;3] (by uniqueness of fixed points) 

^ fs2^A\[f,Jx3] (def. of^^) 

At this point we substitute /sj in the expression for fs^ by ^2[/s3/^3] which yields: 

/,, = AMl[fsJx3\/x2][fsJx3\ ^ AMI/X2][fs,/X3] 

Finally, we prove that /s3 = y4|: 

/.3 = I%,)[fsjxi][fs2/X2][fsjx3] (by g2D) 

^ ^^3 = l%,)[Ai/xi][A\/x2][fsJx3] (expr. for f{si) and ^1) 

=^ /s3 = /ux3.7^(.g, JAi/xi] [742/3:2] (by uniqueness of fixed points) 

^ fs.^Al ' (def. of^i) 
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Thus, we have fs^ = Ai[Al/x2][Al/x3], fs^ = A^lAl/xs] and fs^ = A^. Note that 
^M^i/^s] = ^21^3/^3} since X2 is not free in A^. Similarly, since xi is not free in A2 
and Al, we have that ^i[A^/x2][A|/x3] = Ai{Al/x2}{Al/x3}. Thus /(sj) = \ Si~\, for ah 
i = 1,2,3. n 

As a consequence of Lemma [4.81 we have that if Qi and Q2 are isomorphic functors then 
E.xpg_^/= and Expg^/= are also isomorphic (for instance, this would be true for Gi{X) = 
Bx {X X A) and g2iX) = A x {B x X)). 

We remark that Lemma 14.71 could have been proved as a consequence of Lemma 14.81 
by observing that {I,uJg) is, by construction, a quotient of (Expg/=,5g). 

At this point, because final objects are unique up-to isomorphism, we know that 
e: Expg/= — 7- / is an isomorphism and hence we can conclude that the map behExp„/= 
is injective, since it factorizes into an isomorphism followed by a mono. This fact is the last 
thing we need to prove completeness. 

Theorem 4.9 (Completeness). LetG be a non- deterministic functor. For aU£i,£2 G Expg, 

ei ~ £2 =^ ^1 = £2 

Proof. Let ^ be a non-deterministic functor, let £1,62 £ Expg and suppose that ei ~ £2- 
Because only bisimilar elements are identified in the final coalgebra we know that it must 
be the case that behExpg(£i) = behExpg(e2) and thus, since the equivalence class map [— ] 
is a homomorphism, behExpQ/=([£i]) = behExp„/^([e2])- Because beh^xp^/^ is injective we 
have that [ei] = [£2]- Hence, ei = £2- D 

5. Two MORE EXAMPLES 

In this section we apply our framework to two other examples: labelled transition 
systems (with explicit termination) and automata on guarded strings. These two automata 
models are directly connected to, respectively, basic process algebra and Kleene algebra 
with tests. To improve readability we will present the corresponding languages using a 
more user-friendly syntax than the canonically derived one. 

Labelled transition systems. Labelled transition systems (with explicit termination) are 
coalgebras for the functor 1 -|- (^Id) . As we will show below, instantiating our framework 
for this functor produces a language that is equivalent to the closed and guarded expressions 
generated by the following grammar, where a £ A and x G X {X is a set of fixed point 
variables) : 

P:■. = 0\P + P\a.P\6\^/\ fix.P \ x 
together with the equations (omitting the congruence and a-equivalence rules) 

Pl+P2 = P2 + Pi Pi + (P2 + P3) = {Pi + P2) + P3 

P+P=P P+0=P 

P + 6 = P{i.) V + ^ = V + Pi*) (*) ifP^OandP^ V 

P[^x.P/x] = fix.P P[Q/x] = Q^ inx.P) = Q 

Note that, as expected, there is no law that allows us to prove a.{P + Q) = a.P + a.Q. 
Moreover, observe that this syntax and axiomatization is very similar to the one presented 
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in [T]. In the syntax above, 6 represents deadlock, -y/ successful termination and the 
totally undefined process. 

We will next show how the beautified syntax above was derived from the canonically 
derived syntax for the expressions e S Expl_^,■2^^\A, which is given by the set of closed and 
guarded expressions defined by the following BNF: 

e:: = I e ® e I X I fix.e \ l[ei] \ r[e2] 
ei:: = | ei © ei | * 
ei:: = j £2 © ^2 I a{e') 
£>:: = %\£' @e'\{e] 

We define two maps between this grammar and the grammar presented above. Let us start 
to show how to translate P's into e's, by defining a map (— )^ by induction on the structure 
of P: 





(^) = ^ fa Pit 

(Pi + P2)t = (Pi)t©(P2)t °;(;^ 

(/xx.P)t = ^x.Pt vJ 
xt = X 


= r[ai{P^)] 
= rM 


now the converse translation: 




(0)* 


= mf 


= V 


(ei©e2)* 


= (ei)* + (e2)* (r[0])t 


= 6 


(/ix.e)* 


= /ix.et (r[e2©e2])* 


= {r[e,])t + {r[e',])t 


X* 


= X (r[a(0)])t 


= 6 


m)^ 


= V {r[a{e',®e'M 


= ir[aie[)])t + (r[a(4)])t 


{l[ei®e',]f 


= {l[e,])t + (l[e[])t (r[a({e})])t 


= a.si 



One can prove that if Pi = P2 (using the equations above) then (Pi)^ = (P2)t (using the 
automatically derived equations for the functor) and also that ei = £2 implies (ei)"'" = {£2)^- 

Automata on guarded strings. It has recently been shown [23] that automata on guarded 
strings (acceptors of the join irreducible elements of the free Kleene algebra with tests on 
generators S,r) are coalgebras for the functor B x Id *^ , where At is the set of atoms, 
i.e. minimal nonzero elements of the free Boolean algebra B generated by T and S is a set 
of actions. Applying our framework to this functor yields a language that is equivalent to 
the closed and guarded expressions generated by the following grammar, where 6 G B and 
aG S: 

P::=0\{b)\P + P\b^a.P\ jix.P \ x 

accompanied by the equations (omitting the congruence and a-equivalence rules) 

Pi + P2 = P2 + Pi Pi + (P2 + P3) = (Pi + P2) + P3 

P+P=P P+0=P 

(6i> + (62) = (61 Vb 62) 0=(±b) 

(6^a.0) = (_LB^a.P) = 

(6 ^ a.P2) + {b^ a.P2) = h -^ a.(Pi + P2) (61 ^ a.P) + (62 ^ a.P) = (61 Vb 62) ^ a.P 

P[fix.P/x\ = fix.P P[Q/x] = Q^ ifix.P) = Q 

We will not present a full comparison of this syntax to the one of Kleene algebra with 
tests [23j (and propositional Hoare triples). The differences between our syntax and that 
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of KAT are similar to the ones between regular expressions and the language Exp-p for 
the functor representing deterministic automata (see Definition I3.5p . Similarly to the LTS 
example one can define maps between the beautified syntax and the automatically generated 
one and prove its correctness. 

6. Polynomial and finitary coalgebras 

The functors we considered above allowed us to modularly derive languages and ax- 
iomatizations for a large class of coalgebras. If we consider the subset of NDF without 
the "S functor, the class of coalgebras for these functors almost coincides with polynomial 
coalgebras (that is, coalgebras for a polynomial functor). The only difference comes from 
the use of join-semilattices for constant functors and ^ instead of the ordinary coproduct, 
which played an important role in order for us to be able to have underspecification and 
overspecification. We will next show how to derive expressions and axiomatizations directly 
for polynomial coalgebras, where no underspecification or overspecification is allowed. 

Before we show the formal definition, let us provide some intuition. The main changeqj, 
compared to the previous sections, would be not to have and © and consider an expression 
(— , — ) for the product instead of the two expressions l{—) and r(— ) which we considered 
and an expression (ai(— ), a2(— ), . . . , a„(— )) for the exponential (with A = {ai, . . . an})- As 
an example, take the functor 'D{X) = 2 x X of deterministic automata. The expressions 
corresponding to this functor would then be the set of closed and guarded expressions given 
by the following BNF: 

£:: = x\ nx.e \ (0, (ai(e),a2(e), . . . ,a„(e))) | (1, (ai(e), 02(5), . . . ,a„(e))) 

This syntax can be perceived as an explicit and complete description of the automaton. This 
means that underspecification is nonexistent and the compactness of regular expressions is 
lost. As an example of the verbosity present in this new language, take A = {a, b, c} and 
consider the language that accepts words with only a's and has at last one a (described 
by aa* in Kleene's regular expressions). In the language Exp^ it would be written as 
fj,x.a{l{l) © x). Using the approach described above it would be encoded as the expression 

^x.{0, (a((l, {a{x), b{empty) , c{e'mpty)))) , b{empty), c{empty))) 

where empty = ny.{0, {a{y) , b{y) , c{y)) is the expression denoting the empty language. The 
approach we presented before, by allowing underspecification, provides a more user-friendly 
syntax and stays close to the know syntaxes for deterministic automata and LTSs. 

In what follows we will formally present a language for polynomial coalgebras. We start 
by introducing the definition of polynomial functor, which we take from [2j. 

Definition 6.1 (Polynomial Functor). Sums of the Cartesian power functors are called 
polynomial functors: 

Here, ]J stands for ordinary coproduct and the indexing set S is a signature, that is a 
possibly infinite collection of symbols a, each of which is equipped with a finite cardinal 
ar{a), called the arity of a. Jl» 



This syntax was suggested to us by B. Klin, during CONCUR'09. 
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Definition 6.2 (Expressions and axioms for polynomial functors). Let Ps be a polynomial 
functor. The set Expp^ of expressions for Ps is given by the closed and guarded expressions 
generated by the following BNF, where a £ T, and x £ V, for V a set of fixed point variables: 

Ei-.: = X I fix.e I a{ei,...,ear(a)) 

accompanied by the equations: 

'ylfix.j/x] = fix-j {FP) 

7[e/x] = e =^ fix.j = e ( Unique) 

ei = £2 =^ e[ei/x] = e[£2/x], if x is free in e (Cong) 

/ix.7 = jiy.^[y/x\, if y is not free in 7 [a — equiv) 

Providing the set Expp with a coalgebraic structure is achieved using induction on the 
number of unguarded occurrences of nested fixed points: 

<5:Expp^^ U(ExppJ-('^) 

6{nx.e) = 5{e[fix.e/x]) 

d{a{£i,...,ear{a))) = K^{{£l, ■ ■ ■ jSar^a))) 

We are now ready to state and prove Kleene's theorem. 

Theorem 6.3 (Kleene's theorem for polynomial functors). Let P^ be a polynomial functor. 

(1) For every locally finite coalgebra {S,g: S* — )• Ps(5')) and for every s £ S there exists an 
expression e £ Expp^, such that e ~ s. 

(2) For every expression e £ Expp^, there is a finite coalgebra {S,g: S ^ Pe(S')) with s £ S 
such that s ~ e. 

Proof. Point 1. amounts to solve a system of equations. Let (s) = {si, . . . , s„}. We associate 
with each Si £ (s) an expression (( Sj )) = Af, where Af is defined inductively as in the proof 
of Em with ^f+i = A^{Al^Jxk+i} and ^^ = Ai given by 

Ai = fiXs,.a{x,:^,...,x^:^^^^^), g{si) = «^<7(si, • • • , sI^h) 

It remains to prove that Sj ~ (( Sj )), for all Si £ (s). We observe that 

R = {{S^A{S^)))\S^£{S)} 

is a bisimulation, since, for g{si) = k^^^s'i, . . . , s'^^(^\), we have 

msi))) 

= 6{i^iX^.aix,> X,> )){^?/xi}...{^rV^n}) 

i ar{a) 

= 6{nxi.a{xs'^,. . . , x,,^^^^^){Al/xi] . . . {A^r\/x,_i]{A^-_^Jxi+i] . . . {A'^-^/xn}) 
= 5{a{x,,^,. . . ,x,.^^^p{^?/xi} . . . {^^:2/x,_i}{A^+iM+i} . . . {Al-^/xn}[A^/x,]) 

^t\/x^-i}{A\^Jx,+^ . . . {A^-yxn}{A^/x,}) 
^'-_l/x,^^}{Af/x,}{AUi/x,+i} . . . {ArV^n}) 



6{a{x,> ...,x,, ){Al/xi}...{A. 

i- ar{a) 

6{a{xs',...,x,> ){Ayxi}...{A. 

1 ar{a) 

k.((( .;)),..., ((<,(^)))) 
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For point 2, we observe that the subcoalgebra (e), for any e G Expp^, is finite, since the 
set c/(e) containing all sub-formulas and unfoldings of fixed points of e, which is finite, is 
a subcoalgebra of (Expp , (5). The fact that in this point, contrary to what happened in 
Theorem 13.141 we do not need to quotient the set of expressions is a direct consequence of 
the absence of underspecification or, more concretely, of the expressions and ©. □ 

The proof of soundness and completeness would follow a similar strategy as in the 
previous section and we will omit it here. 

In order to be able to compare the language introduced in this section with the language 
obtained in our previous approach, we have to define an infinitary version of the operator ■$■ 
and extend the framework accordingly. We start by defining the aforementioned operator 
on sets: \Kig/Xj = (JJ^^j Xj) U {_L,T} and the corresponding functor, for which we shall 
use the same symbol, is defined pointwise in the same way as for '^. Note that ■$■ is a special 
case of this operator (resp. functor) for / a two element set. In fact, for simplicity, we shall 
only consider this operator for index sets / with two or more elements. 

There is a natural transformation between polynomial functors and the class of non- 
deterministic functors extended with \k: every polynomial functor P^ is mapped to 

Next, we slightly alter the definition of expressions. Instead of the expressions /[— ] and r[— ] 
we had before for ^ we add an expression i[—] for each i £ I and the expected typing rule: 

hj[e]: ^.e/7-,<e 

All the other elements in our story are adjusted in the expected way. We show what happens 
in the axiomatization. For ^ we had the rules 

l[ei e 82] = l[ei] ® l[e2] r[ei e £2] = r[ei] e r[£2] Z[ei] © r[e2] = ^[0] © r[0] 

which are now replaced by 

i[£i] ® i[£2] = i[ei ® £2] i[ei]®j[e2] = k\^]®lM, i^3,k^l 
It is natural to ask what is the relation between the sets of expressions Expp and 
Expp . The set Expp^ is bijective to the subset of Expp containing only fully specified 
expressions, that is expressions e for which the subcoalgebra (e) does not contain any state 
for which &p evaluates to _L and T. This condition is purely semantical and we were not 
able to find a purely syntactic restriction that would capture it. 

We next repeat the exercise above for finitary functors. A finitary functor F is a 
functor that is a quotient of a polynomial functor, i.e. there exists a natural transformation 
7]-. Ps — )• F, whose components rjx'- Ps(-'^) — ^ F(A') are epimorphisms. We define Expp = 
Expp^. 

Theorem 6.4 (Kleene's theorem for finitary functors). Let Y he a finitary functor. 

(1) Let {S, f) be a locally-finite F-coalgebra. Then, for any s G S, there exists an expression 
{{s)) £ Expp such that s ~ ((«))■ 

(2) Let £ £ Expp. Then, there exists a finite F-coalgebra {S, f) with s G S such that s ~ e. 

Proof. Let F be a finitary functor (quotient of a polynomial functor Ps). 
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(T) Let {S, f) be a locally finite F-coalgebra and let s £ S. We denote by T = 
{si,...,s„} the state space of the subcoalgebra (s) (with si = s). We then have that 
there exists an /'' making the following diagram commute: 



>T >S 



/« 



/ 



/ 



Ps(T)^^F(r) >F{S) 



We then build (( s )) w.r.t /^ just as in Theorem 13.121 (note that (T, /") is finite) and the 
result follows because (( s )) ~f s <^ (( s )) ~Pj, s (consequence of naturality). 

(2) Let e G Expp. By Theorem l3.14l there exists a finite Ps-coalgebra (S, f) with s € S 
such that s ^p^ £■ Thus, we take {S, rjs o /) and we have a finite F-coalgebra with s G 5 
such that e ~f s. D 

For the axiomatization a bit more ingenuity is required. One needs to derive which 
extra axioms are induced by the epimorphism and then prove that they are sound and 
complete. 

For instance, the finite powerset functor (which we included in the syntax of non- 
deterministic functors) is the classical example of a finitary functor. It is the quotient 
of the polynomial functor Pj^{X) = 1 + X + X^ + . . . (this represents lists of length n) 
by identifying lists that contain precisely the same elements (that is, eliminating repeated 
elements and abstracting from the ordering). 

The syntax for Expp^ is the set of closed and guarded expressions given by the following 
BNF: 

e:: = X \ fix.e \ i{ei, . . . ,ei), i £ 'N 
together with the axioms for the fixed point, {a — equiv) and (Cong). 

Taking into account the restriction mentioned we would have to include the extra ax- 
ioms: 

i{ei, ...,ei) = i{e[, . . . ,6^ if {ei, . . . Si} = {e[,... e'^} 

i{£i,e2,...,£i) = {i- l)(ei, £3, •••,£») if ei = £2 
In this case, one can see that this set of axioms is sound and complete, by simply proving, 
for Ps(X) = 1 + X + X"^ + . . ., Exppj,/= = Exp^/= (since we already had a language and 
sound and complete axiomatization for the 1^ functor). The restricted syntax and axioms 
needs to be derived for each concrete finitary functor. Finding a uniform way of defining 
such restricted syntax/axioms and also uniformly proving soundness and completeness is a 
challenging problem and it is left as future work. 

7. Discussion 

We presented a systematic way of deriving, from the type of a system, a language of 
(generalized) regular expressions and a sound and complete axiomatization thereof. We 
showed the analogue of Kleene's theorem, proving the correspondence of the behaviours 
captured by the expressions and the systems under consideration. The whole approach 
was illustrated with five examples: deterministic finite automata, partial deterministic au- 
tomata, non-deterministic automata, labelled transition systems and automata on guarded 
strings. Moreover, all the results presented in [9] for Mealy machines can be recovered as a 
particular instance of the present framework. 
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Iterative theories have been introduced by Elgot [15j as a model of computation and 
they formahze potentially infinite computations as solutions of recursive equations. The 
main example of an iterative theory is the theory of regular trees, that is trees which have 
on finitely many distinct subtrees. Adamek, Milius and Velebil have presented Elgot's work 
from a coalgebraic perspective [31 H], simplified some of his original proofs, and generalized 
the notion of free iterative theory to any finitary endofunctor of every locally presentable 
category. The language modulo the axioms we will associate with each functor is closely 
related to the work above: it is an initial iterative algebra. This also shows the connection 
of our work with the work by Bloom and Esik on iterative algebras/theories [5]. It would 
be interesting to investigate the connections with iterative algebras further. 

In ^20j, a bialgebraic review of deterministic automata and regular expressions was pre- 
sented. One of the main results of ^20j was a description of the free algebra and Brzozowski 
coalgebra structure on regular expressions as a bialgebra with respect to a GSOS law. We 
expect that this extends to our framework, but fully working this out is left as future work. 

In this paper we studied coalgebras for Set functors. It is an important and challenging 
question to extend our results to other categories. Following our work, S. Milius [27] has 
showed how to derive a language and sound and complete axiomatization for the functor 
M X Id in the category of vector spaces and linear maps. It would also be interesting to 
study functors over metric spaces [HH [l2] . 

In his seminal paper [21j, S. Kleene introduced an algebraic description of regular lan- 
guages: regular expressions. This was the precursor of many papers, including this one. 
Salomaa [33j presented a sound and complete axiomatization for proving the equivalence 
of regular expressions. This was later refined by Kozen in [22]: he showed that Salomaa's 
axiomatization is non-algebraic, in the sense that it is unsound under substitution of alpha- 
bet symbols by arbitrary regular expressions, and presented an algebraic axiomatization. 
In |28j, Milner introduced a set of expressions for finite LTS's and proved an analogue of 
Kleene's theorem: each expression denotes the behaviour of a finite LTS and, conversely, 
the behaviour of a finite LTS can be specified by an expression. He also provided an axiom- 
atization for his expressions, with the property that two expressions are provably equivalent 
if and only if they are bisimilar. 

Our approach is inspired by the work of Kleene, Kozen and Milner. For that reason, 
we have and © in the syntax of our expressions, which allow to have underspecification 
and overspecification. These features had to be refiected in the type of the coalgebras we 
are able to deal with: the class of functors considered include join-semilattices as constant 
functors and ^ instead of the ordinary coproduct, which has allowed us to remain in the 
category Set. The fact that underspecification and overspecification can be captured by a 
semilattice structure, plus the fact that the axiomatization provides the set of expressions 
with a join semilattice structure, hint (as one of the reviewers pointed out) that the whole 
framework could have been studied directly in the category of join-semilattices. This is 
indeed true, but, for simplicity, we decided to remain in the category Set. It is not clear 
how much could be gained by directly working on join semi-lattices. 

The connection between regular expressions and coalgebras was first explored in |32j . 
There deterministic automata, the set of formal languages and regular expressions are all 
presented as coalgebras of the functor 2 x Id (where A is the alphabet, and 2 is the 
two element set). It is then shown that the standard semantics of language acceptance of 
automata and the assignment of languages to regular expressions both arise as the unique 
homomorphism into the final coalgebra of formal languages. The coalgebra structure on the 
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set of regular expressions is determined by their so-called Brzozowski derivatives [13] . In the 
present paper, the set of expressions for the functor 2? (5) = 2 x S differs from the classical 
definition in that we do not have Kleene star and full concatenation (sequential composition) 
but, instead, the least fixed point operator and action prefixing. Modulo that difference, the 
definition of a coalgebra structure on the set of expressions in both [32] and the present paper 
is essentially the same. All in all, one can therefore say that standard regular expressions 
and their treatment in [32| can be viewed as a special instance of the present approach. 
This is also the case for the generalization of the results in [32j to automata on guarded 
strings [23]. Finally, the present paper extends the results in our FoSSaCS'08 paper [9], 
where a sound and complete specification language and a synthesis algorithm for Mealy 
machines is given. Mealy machines are coalgebras of the functor (B x Id) , where A is 
a finite input alphabet and B is a finite semilattice for the output alphabet. Part of the 
material of the present paper is based on two conference papers: our FoSSaCS'09 paper [TT] 
and our LICS'09 paper [lU]. 

In the last few years, several proposals of specification languages for coalgebras appeared 
[29l[3Ql[IS[l7l[l4l[7l[8l[34l[24]. Our approach is similar in spirit to that of [III [30l [H [34] 
in that we use the ingredients of a functor for typing expressions, and differs from [30^ [19] 
because we do not need an explicit "next-state" operator, as we can deduce it from the type 
information. The modal operators associated to a functor in ^30 [ ll9 [ [34j can easily be related 
with the expressions considered in our language. As an example, consider the expression 
{■7T2)[Ki]{a)-L, written in the syntax of [30], which belongs to the language associated with 
the functor 2 x (Id + 1) (the modal operator (a) is next operator associated with the identity 
functor). In our language, this would be represented by r(/[0]). 

Apart from [23], the languages mentioned above do not include fixed point operators. 
Our language of regular expressions can be seen as an extension of the coalgebraic logic 
of [7J with fixed point operators, as well as the multi-sorted logics of [34J, and it is similar 
to a fragment of the logic presented in [21]. However, our goal is rather different: we 
want (1) a finitary language that characterizes exactly all locally finite coalgebras; (2) a 
Kleene like theorem for the language or, in other words, a map (and not a relation) from 
expressions to coalgebras and vice- versa. Similar to many of the works above, we also derive 
a modular axiomatization, sound and complete with respect to observational equivalence. 
From the perspective of modal logic, the second half of Kleene's theorem, where we show 
how to construct a coalgebra from an expression, is the same as constructing a canonical 
model. In [33], the models presented for the multi-sorted logics are multi-sorted coalgebras, 
whereas here we remain in the world of coalgebras in the category Set, constructing, from an 
expression in Expg, for a given functor Q, a ^-coalgebra. Further exploring the connections 
with the approach presented in |34J is a promising research path, opening the door to 
extending our framework for more general classes of functors. 

In conclusion, we mention a recent generalization of the present approach: all the results 
presented in this paper can be extended in order to accommodate systems with quantities, 
such as probability or costs [6]. The main technical challenge is that quantitative systems 
have an inherently non-idempotent behaviour and thus the proof of Kleene's theorem and 
the axiomatization require extra care. This extension allows for the derivation of specifica- 
tion languages and axiomatizations for a wide variety of systems, which include weighted 
automata, simple probabilistic systems (also known as Markov chains) and systems with 
mixed probability and non-determinism (such as Segala systems). For instance, we have 
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derived a language and an axiomatization for the so-called stratified systems. The language 
is equivalent to the one presented in [16], but no axiomatization was known. 

The derivation of the syntax and axioms associated with each non-deterministic functor 
has been implemented in the coinductive prover CIRC [26j. This allows for automatic 
reasoning about the equivalence of expressions specifying systems. 
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